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VIRAL AGEI^T 

The present invention relates to the isolation and characterisation of 
the viral agent responsible for post-transfusional non-A non-B 
hepatitis (PT-NANBH) and in particular to PT-NANBH viral polypeptides, 
DNA sequences encoding such viral polypeptides, expression vectors 
containing such DNA sequences, and host cells transformed by such 
expression vectors. The present invention also relates to the use of 
a DNA sequence in a nucleic acid hybridisation assay for the diagnosis 
of PT-NANBH. The present invention further relates to the use of 
PT-NANBH viral polypeptides or polyclonal or monoclonal antibodies 
against such polypeptides in an immunoassay for the diagnosis of 
PT-NANBH or in a vaccine for its prevention, 

Non-A non-B hepatitis (NANBH) is by definition a diagnosis of 
exclusion and has generally been employed to describe cases of viral 
hepatitis infection in human beings that are not due to hepatitis A or 
B viruses. In the majority of such cases, the cause of the infection 
has not been identified, although, on clinical and epidemiological 
grounds, a number of agents have been thought to be responsible as 
reviewed in Shih et al ( Prog. Liver Pis , . 1986, 8, 433-452). In the 
USA alone, up to 10% of blood transfusions can result in NANBH which 
makes it a significant problem. Even for PT-NANBH there may be at 
least several viral agents responsible for the infection and over the 
years many claims have been made for the identification of the agent, 
none of which has been substantiated. 

European Patent Application 88310922.5 purports to describe the 
isolation and characterisation of the aetiological agent responsible 
for PT-NANBH which is also referred to in the application as hepatitis 
C virus (HCV) . A cDNA library was prepared from viral nucleic acid 
obtained from a chimpanzee infected with PT-NANBH and was screened 
using human antisera. A number of positive clones were isolated and 
sequenced. The resulting nucleic acid and amino acid sequence data, 
which are described in the application, represent approximately 70% of 
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the lOkb viral genome and are derived entirely from its 3' -end 
corresponding to the non- structural coding region. 

The present inventors have now isolated and characterised PT-NANBH 
viral polypeptides by the cloning and expression of DNA sequences 
encoding such viral polypeptides. Surprisingly, the nucleic acid and 
amino acid sequence data both show considerable differences with the 
corresponding data reported in European Patent Application 88310922,5. 
Overall these differences amount to about 20% at the nucleic acid 
level and about 15% at the amino acid level but some regions of the 
sequences show even greater differences. The overall level of 
difference is much larger than would be expected for two isolates of 
the same virus even allowing for geographical factors, and is believed 
to be due to one of two possible reasons. 

Firstly, the present inventors and those of the aforementioned 
European Patent Application used different sources for the nucleic 
acid used in the cDNA cloning. In particular, the European Patent 
Application describes the use of chimpanzee plasma as the source for 
the viral nucleic acid starting material, with the virus having been 
passaged through a chimpanzee on two occasions, PT-NANBH is of course 
an human disease and passaging the virus through a foreign host, even 
if it is a close relative to hxaraans, is likely to result in extensive 
mutation of the viral nucleic acid. Accordingly, the sequence data 
contained in European Patent Application 88310922.5 may not be truly 
representative of the actual viral agent responsible for PT-NANBH in 
humans. In contrast, the present inventors utilised viral nucleic 
acid from a human plasma source as the starting material for cDNA 
cloning. The sequence data thus obtained is much more likely to 
correspond to the native nucleic acid and amino acid sequences of 
PT-NANBH. 

Secondly, it may be that the viral agent exists as more than one 
subtype and the sequence data described in the European Patent 
Application and that elucidated by the present inventors correspond to 
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separate and distinct subtypes of the same viral agent. 
Alternatively, it may be that the level of difference between the two 
sets of sequence data is due to a combination of these two factors. 

The present invention provides a PT-NANBH vipal polypeptide comprising 
an antigen having an amino acid sequence that is at least 90% 
homologous with the amino acid sequence set forth in SEQ ID NO: 3,4,5, 
18,19,20,21 or 22, or is an antigenic fragment thereof. 

SEQ ID NO : 3,4,5,18,19,20,21 or 22 set forth the amino acid sequence 
as deduced from the nucleic acid sequence. Preferably, the amino acid 
sequence is at least 95% or even 98% homologous with the amino acid 
sequence set forth in SEQ ID NO : 3,4.5.18.19.20,21 or 22. 
Optionally, the antigen may be fused to an heterologous polypeptide. 

Two or more antigens may optionally be used together either in 
combination or fused as a single polypeptide. The use of two or more 
antigens in this way in a diagnostic assay provides more reliable 
results in the use of the assay in blood screening for PT-NANBH virus. 
Preferably, one antigen is obtained from the structural coding region 
(the 5' -end) and one other antigen is obtained from the non- structural 
coding region (the 3 '-end). It is particularly preferred that the 
antigens are fused together as a recombinant polypeptide. This latter 
approach offers a number of advantages in that the individual antigens 
can be combined in a fixed, pre -determined ratio (usually equimolar) 
and only a single polypeptide needs to be produced, purified and 
characterised. 

An antigenic fragment of an antigen having an amino acid sequence that 
is at least 90% homologous with that set forth in SEQ ID NO : 3,4,5, 
18,19,20,21 or 22 preferably contains a minimum of five, six, seven, 
eight, nine or ten. fifteen, twenty, thirty, forty or fifty amino 
acids. The antigenic sites of such antigens may be identified using 
standard procedures. These may involve fragmentation of the 
polypeptide itself using proteolytic enzymes or chemical agents and 
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then determining the ability of each fragment to bind to antibodies or 
to provoke an immune response when inoculated into an animal or 
suitable in vitro model system (Strohmaier §1, J rQen, Vj,ro3, . , 1982, 
52, 205-306). Alternatively, the DNA encoding the polypeptide may be 
fragmented by restriction enzyme digestion or other well-known 
techniques and then introduced into an expression system to produce 
fragments (optionally fused to a polypeptide usually of bacterial 
origin) . The resulting fragments are assessed as described previously 
(Spence ^ &1. J. Gen, Virol . . 1989, 70, 2843-51; Smith et si. 
1984, Zlf 263-9). Another approach is to synthesise chemically short 
peptide fragments (3-20 amino acids long; conventionally 6 amino acids 
long) which cover the entire sequence of the full-length polypeptide 
with each peptide overlapping the adjacent peptide. (This overlap can 
be from 1-10 amino acids but ideally is n-1 amino acids where n is the 
length of the peptide; Geysen et al, Proc. Natl, Acad, Sci. , 1984, 81, 
3998-4002), Each peptide is then assessed as described previously 
except that the peptide is usually first coupled to some carrier 
molecule to facilitate the induction of an immune response. Finally, 
there are predictive methods which involve analysis of the sequence 
for particular features, e.g. hydrophilicity , thought to be associated 
with immunologically important sites (Hopp and Woods, groc. — Natl. 
Acad, Sci .. 1981, 28, 3824-8; Berzofsky. Science . 1985, 229, 932-40). 
These predictions may then be tested using the recombinant polypeptide 
or peptide approaches described previously. 

Preferably, the viral polypeptide is provided in a pure form, i.e. 
greater than 90% or even 95% purity. 

The FT-NANBH viral polypeptide of the present invention may be 
obtained using an amino acid synthesiser, if it is an antigen having 
no more than about thirty residues, or by recombinant DNA technology. 

The present invention also provides a DNA sequence encoding a PT-NANBH 
viral polypeptide as herein defined. 



MJS/AC/12th December 1990 



5 . PA1121 



The DNA sequence of the present invention may be synthetic or cloned. 
Preferably, the DNA sequence is as set forth in SEQ ID NO : 3,4,5,18, 
19,20,21 or 22. 

To obtain a PT-NANBH viral polypeptide comprising multiple antigens, 
it is preferred to fuse the Individual coding sequences into a single 
open reading frame. The fusion should of course be carried out in 
such a manner that the antigenic activity of each antigen is not 
significantly compromised by its position relative to another antigen. 
Particular regard should of course be had for the nature of the 
sequences at the actual junction between the antigens. The methods by 
which such single polypeptides can be obtained are well known in the 
art. 

The present invention also provides an expression vector containing a 
DNA sequence, as herein defined, and being capable in an appropriate 
host of expressing the DNA sequence to produce a PT-NANBH viral 
polypeptide. 

The expression vector normally contains control elements of DNA that 
effect expression of the DNA sequence in an appropriate host. These 
elements may vary according to the host but usually include a 
promoter, ribosome binding site.translational start and stop sites, 
and a transcriptional termination site. Examples of such vectors 
include plasraids and viruses. Expression vectors of the present 
invention encompass both extrachromosomal vectors and vectors that are 
integrated into the host cell's chromosome. For use in E.poli , the 
expression vector may contain the DNA sequence of the present 
invention optionally as a fusion linked to either the 5'- or 3 '-end of 
the DNA sequence encoding, for example, /3-galactosidase or to the 3'- 
end of the DNA sequence encoding, for example, the trp E gene. For 
use in the insect baculovirus (AcNPV) system, the DNA sequence is 
optionally fused to the polyhedrin coding sequence. 
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The present invention also provides a host cell transformed with an 
expression vector as herein defined. 

Examples of host cells of use with the present invention include 
prokaryotic and eukaryotic cells, such as bacterial, yeast, mammalian 
and insect cells. Particular examples of such cells are E.coll . 
S.cerevisiae . p.pastorls . Chinese hamster ovary and moxise cells, and 
Spodoptera frup:ip6rda and Tricoplusia pi. The choice of host cell may 
depend on a number of factors but, if post-translational modification 
of the PT-NANBH viral polypeptide is important, then an eukaryotic 
host would be preferred. 

The present invention also provides a process for preparing PT-NANBH 
viral polypeptide which comprises cloning or synthesising a DNA 
sequence encoding PT-NANBH viral polypeptide, as herein defined, 
inserting the DNA sequence into an expression vector such that it is 
capable in an appropriate host of being expressed, transforming an 
host cell with the expression vector, culturing the transformed host 
cell, and isolating the viral polypeptide. 

The cloning of the DNA sequence may be carried out using standard 
procedures knoxm in the art. However, it is particularly advantageous 
in such procedures to employ the sequence data disclosed herein so as 
to facilitate the identification and isolation of the desired cloned 
DNA sequences. Preferably, the RNA is isolated by pelleting the virus 
from plasma of infected humans identified by implication in the 
transmission of PT-NANBH, The isolated RNA is reverse transcribed 
into cDNA using either random or oligo-dT priming. Optionally, the 
RNA may be subjected to a pre-treatment step to remove any secondary- 
structure which may interfere with cDNA synthesis, for example, by 
heating or reaction with methyl mercuric hydroxide. The cDNA is 
usually modified by addition of linkers followed by digestion with a 
restriction enzyme. It is then inserted into a cloning vector, such 
as pBR322 or a derivative thereof or the lambda vectors gtlO and gtll 
(Huynh et ^, DNA Cloning . 1985, Vol 1: A Pr^^ctical Appr oach, Oxford, 
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IRC Press) packaged into virions as appropriate, and the resulting 
recombinant DMA molecules used to transform E.coli and thus generate 
the desired library. 

The library may be screened using a standard screening strategy. If 
the library is an expression library, it may be screened using an 
immunological method with antisera obtained from the same plasma 
source as the RNA starting material and also with antisera from 
additional human sources expected to be positive for antibodies 
against PT-NANBH. Since human antisera usually contains antibodies 
against E.coli which may give rise to high background during 
screening, it is preferable first to treat the antisera with 
untransformed E>coli lysate so as to remove any such antibodies. It 
is advantageous to employ a negative control using antisera from 
accredited human donors, i.e. human donors who have been repeatedly 
tested and found not to have antibodies against viral hepatitis . An 
alternative screening strategy would be to employ as hybridisation 
probes one or more labelled oligonucleotides. The use of 
oligonucleotides in screening a cDNA library is generally simpler and 
more reliable than screening with antisera. The oligonucleotides are 
preferably synthesised using the DNA sequence information disclosed 
herein. One or more additional rounds of screening of one kind or 
another may be carried out to characterise and identify positive 
clones , 

Having identified a first positive clone, the library may be 
rescreened for additional positive clones using the first clone as an 
hybridization probe. Alternatively or additionally, further libraries 
may be prepared and these may be screened using immunoscreens or 
hybridisation probes. In this way» further DNA sequences may be 
obtained. 

Alternatively, the DNA sequence encoding PT-NANBH viral polypeptide 
may be synthesised using standard procedures and this may be preferred 
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to cloning the DNA in some circumstances (Gait, piigonucj-eot^de 
Synthesis: A Practical Approach . 1984, Oxford, IRL Press), 

Thus cloned or synthesised, the desired DNA sequence may be inserted 
into an expression vector using known and standard techniques. The 
expression vector is normally cut using restriction enzymes and the 
DNA sequence inserted using blunt-end or staggered-end ligation. The 
cut is usually made at a restriction site in a convenient position in 
the expression vector such that, once inserted, the DNA sequence is 
under the control of the functional elements of DNA that effect its 
expression. 

Transformation of an host cell may be carried out using standard 
techniques. Some phenotypic marker is usually employed to distinguish 
between the transformants that have successfully taken up the 
expression vector and those that have not. Culturing of the 
transformed host cell and isolation of the PT-NANBH viral polypeptide 
may also be carried out using standard techniques. 

Antibody specific to a PT-NANBH viral polypeptide of the present 
invention can be raised using the polypeptide. The antibody may be 
polyclonal or monoclonal. The antibody may be used in quality control 
testing of batches of PT-NANBH viral polypeptide; purification of a 
PT-NANBH viral polypeptide or viral lysate; epitope mapping; when 
labelled, as a conjugate in a competitive type assay, for antibody 
detection; and in antigen detection assays. 

Polyclonal antibody against a PT-NANBH viral polypeptide of the 
present invention may be obtained by injecting a PT-NANBH viral 
polypeptide, optionally coupled to a carrier to promote an immune 
response, into a mammalian host, such as a mouse, rat, sheep or 
rabbit, and recovering the antibody thus produced. The PT-NANBH viral 
polypeptide is generally administered in the form of an injectable 
formulation in which the polypeptide is admixed with a physiologically 
acceptable diluent. Adjuvants, such as Preund's complete adjuvant 
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(FCA) or Freund's incomplete adjuvant (FIA) , may be included in the 
formulation. The formulation is normally injected into the host over 
a suitable period of time, plasma samples being taken at appropriate 
intervals for assay for anti-PT-NANBH viral antibody , When an 
appropriate level of activity is obtained, the host is bled. Antibody 
is then extracted and purified from the blood plasma using standard 
procedures, for example, by protein A or ion-exchange chromatography. 

Monoclonal antibody against a PT-NANBH viral polypeptide of the 
present invention may be obtained by fusing cells of an immortalising 
cell line with cells which produce antibody against the viral 
polypeptide, and culturing the fused immortalised cell line. 
Typically, a non-human mammalian host, such as a mouse or rat, is 
inoculated with the viral polypeptide. After sufficient time has 
elapsed for the host to mount an antibody response, antibody producing 
cells, such as the splenocytes, are removed. Cells of an 
immortalising cell line, such as a mouse or rat myeloma cell line, are 
fused with the antibody producing cells and the resulting fusions 
screened to identify a cell line, such as a hybridoma, that secretes 
the desired monoclonal antibody. The fused cell line may be cultured 
and the monoclonal antibody purified from the culture media in a 
similar manner to the purification of polyclonal antibody. 

Diagnostic assays based upon the present invention may be used to 
determine the presence or absence of PT-NANBH infection. They may 
also be used to monitor treatment of such infection, for example in 
interferon therapy. 

In an assay for the diagnosis of viral infection, there are basically 
three distinct approaches that can be adopted involving the detection 
of viral nucleic acid, viral antigen or viral antibody. Viral nucleic 
acid is generally regarded as the best indicator of the presence of 
the virus itself and would identify materials likely to be infectious. 
However, the detection of nucleic acid is not usually as 
straightforward as the detection of antigens or antibodies since the 
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level of target can be very low. Viral antigen is used as a marker 
for the presence of virus and as an indicator of infectivity. 
Depending upon the virus, the amount of antigen present in a sample 
can be very low and difficult to detect. Antibody detection is 
relatively straightforward because, in effect, the host immune system 
is amplifying the response to an infection by producing large amounts 
of circulating antibody. The nature of the antibody response can 
often be clinically useful, for example IgU rather than IgG class 
antibodies are Indicative of a recent infection, or the response to a 
particular viral antigen may be associated with clearance of the 
virus. Thus the exact approach adopted for the diagnosis of a viral 
infection depends upon the particular circumstances and the 
information sought. In the case of PT-NANBH. a diagnostic assay may 
embody any one of these three approaches. 

In an assay for the diagnosis of PT-NANBH involving detection of viral 
nucleic acid, the method may comprise hybridising viral RNA present in 
a test sample, or cDNA synthesised from such viral RNA. with a DNA 
sequence corresponding to the nucleotide sequence of SEQ ID NO : 
3,4,5,18,19,20,21 or 22 and screening the resulting nucleic acid 
hybrids to identify any PT-NANBH viral nucleic acid. The application 
of this method is usually restricted to a test sample of an 
appropriate tissue, such as a liver biopsy, in which the viral RNA is 
likely to be present at a high level. The DNA sequence corresponding 
to the nucleotide sequence of SEQ ID NO : 3,4,5,18.19,20,21 or 22 may 
take the form of an oligonucleotide or a cDNA sequence optionally 
contained within a plasmid. Screening of the nucleic acid hybrids is 
preferably carried out by using a labelled DNA sequence. One or more 
additional rounds of screening of one kind or another may be carried 
out to characterise further the hybrids and thus identify any PT-NANBH 
viral nucleic acid. The steps of hybridisation and screening are 
carried out in accordance with procedures known in the art. 

Because of the limited application of this method in assaying for 
viral nucleic acid, a preferred and more convenient method comprises 
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synthesising cDNA from viral RNA present in a test sample, amplifying 
a preselected DNA sequence corresponding to a subsequence of the 
nucleotide sequence of SEQ ID NO : 3,4,5,18,19.20,21 or 22. and 
identifying the preselected DNA sequence. The test sample may be of 
any appropriate tissue or physiological fluid and is preferably 
concentrated for any viral RNA present. Examples of an appropriate 
tissue include a liver biopsy. Examples of an appropriate 
physiological fluid include urine, plasma, blood, serum, semen, tears, 
saliva or cerebrospinal fluid. Preferred examples are serum and 
plasma. 

Synthesis of the, cDNA is normally carried out by primed reverse 
transcription using random, defined or oligo-dT primers. 
Advantageously, the primer is an oligonucleotide corresponding to the 
nucleotide sequence of SEQ ID NO : 3,4,5,18,19,20,21 or 22 and 
designed to enrich for cDNA containing the preselected sequence. 

Amplification of the preselected DNA sequence is preferably carried 
out using the polymerase chain reaction (PGR) technique (Saiki et §1, 
Science . 1985, 220. 1350-4), In this technique, a pair of 
oligonucleotide primers is employed one of which corresponds to a 
portion of the nucleotide sequence of SEQ ID NO : 3,4,5,18,19,20,21 or 
22 and the other of which is located to the 3' side of the first and 
corresponds to a portion of the complementary sequence, the pair 
defining between them the preselected DNA sequence. The 
oligonucleotides are usually at least 15, optimally 20 to 26, bases 
long and, although a few mismatches can be tolerated by varying the 
reaction conditions, the 3 '-end of the oligonucleotides should be 
perfectly complementary so as to prime effectively. The distance 
between the 3' -ends of the oligonucleotides may be from about 100 to 
about 2000 bases. Conveniently, one of the pair of oligonucleotides 
that is used in this technique is also used to prime cDNA synthesis. 
The PCR technique itself is carried out on the cDNA in single stranded 
form using an enzyme, such as Taq polymerase, and an excess of the 
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oligonucleotide primers over 20-40 cycles in accordance with published 
protocols (Saiki ^ £l, Science . 1988. lli, 487-491), 

As a refinement of the technique, there may be several rounds of 
amplification, each round being primed by a different pair of 
oligonucleotides. Thus, after the first round of amplification, an 
internal pair of oligonucleotides defining a shorter DNA sequence (of, 
say, from 50 to 500 bases long) may be used for a second round of 
amplification. In this somewhat more reliable refinement, referred to 
as 'Nested PGR' . it is of course the final amplified DNA sequence that 
constitutes the preselected sequence. (Kemp et al, froc, N^tl,, — Acad, 

Sci. . 1989. MID,. 2423-7 and Mullis et al. Methods i,n EnzymolofiYi 

1987, lis, 335-350). 

Identification of the preselected DNA sequence may be carried out by 
analysis of the PGR products on an agarose gel. The presence of a 
band at the molecular weight calculated for the preselected sequence 
is a positive indicator of viral nucleic acid in the test sample. 
Alternative methods of identification include those based on Southern 
blotting, dot blotting, oligomer restriction and DNA sequencing. 

The present invention also provides a test kit for the detection of 
PT-NANBH viral nucleic acid, which comprises 

i) a pair of oligonucleotide primers one of which corresponds to a 
portion of the nucleotide sequence of SEQ ID NO : 
3,4.5,18,19.20.21 or 22 and the other of which is located to the 
3' side of the first and corresponds to a portion of the 
complementary sequence, the pair defining between them a 
preselected DNA sequence; 

ii) a reverse transcriptase enzyme for the synthesis of cDNA from 
test sample RNA upstream of the primer corresponding to the 
complementary nucleotide sequence of SEQ ID NO : 
3,4.5.18.19,20.21 or 22; 
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ili) an enzyme capable of amplifying the preselected DNA sequence; and 
optionally; 

Iv) washing solutions and reaction buffers. 

Advantageously, the test kit also contains a positive control sample 
to facilitate in the identification of viral nucleic acid. 

The characteristics of the primers and the enzymes are preferably as 
described above in connection with the PGR technique. 

In an assay for the diagnosis of PT-NANBH involving detection of viral 
antigen or viral antibody, the method may comprise contacting a test 
sample with a PT-NANBH viral polypeptide of the present invention, or 
polyclonal or monoclonal antibody against the polypeptide, and 
determining whether there is any antigen- antibody binding contained 
within the test sample. For this purpose, a test kit may be provided 
comprising a PT-NANBH viral polypeptide, as defined herein, or a 
monoclonal or polyclonal antibody thereto, and means for determining 
whether there is any binding with antibody or antigen respectively 
contained in the test sample. The test sample may be taken from any of 
the appropriate tissues and physiological fluids mentioned above for 
the detection of viral nucleic acid. If a physiological fluid is 
obtained, it may optionally be concentrated for any viral antigen or 
antibody present, 

A variety of assay formats may be employed. The PT-NANBH viral 
polypeptide can be used to capture selectively antibody against 
PT-NANBH from solution, to label selectively the antibody already 
captured, or both to capture and label the antibody. In addition, the 
viral polypeptide may be used in a variety of homogeneous assay 
formats in which the antibody reactive with the antigen is detected in 
solution with no separation of phases. 
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The types of assay In which the PT-NANBH viral polypeptide is used to 
captxire antibody from solution involve Inmobilization of the 
polypeptide onto a solid surface. This surface should be capable of 
being vashed in some way. Examples of suitable surfaces include 
polymers of various types (moulded into miorotitre wells; beads; 
dipsticks of various typos; aspiration tips; electrodes; and optical 
devices), particles (for example latex; stabilized red blood cells; 
bacterial or fungal cells; spores; gold or other metallic or 
metal- containing sols; and protelnaceoua colloids) with the usual size 
of the particle being from 0,02 to 5 microns, membranes (for example 
of nitrocellulose; paper; cellulose acetate; and high porosity/high 
surface area membranes of an organic or inorganic material) . 

The attachment of the PT-NANBH viral polypeptide to the- surface can be 
by passive adsorption from a solution of optimum composition which may 
include surfactants, solvents, salts and/or chaotropes; or by active 
chemical bonding. Active bonding may be through a variety of reactive 
or activatible functional groups which may be exposed on the surface 
(for example condensing agents; active acid esters, halldes and 
anhydrides; amino, hydroxyl, or carboxyl groups; sulphydryl groups; 
carbonyl groups; dlazo groups; or unsaturated groups) . Optionally, 
the active bonding may be through a protein (itself attached to the 
surface passively or through active bonding) , such as albumin or 
casein, to which the viral polypeptide may be chemically bonded by any 
of a variety of methods. The use of a protein in this way may confer 
advantages because of isoelectric point, charge, hydrophllicity or 
other physico-chemical property. The viral polypeptide may also be 
attached to the surface (usually but not necessarily a membrane) 
following electrophoretlc separation of a reaction mixture, such as 
immune precipitation. 

After contacting (reacting) the surface bearing the PT-NANBH viral 
polypeptide with a test sample, allowing tlae for reaction, and, where 
necessary, removing the excess of the sample by any of a variety of 
means, (such as washing, centrifugatlon, filtration, magnetism or 
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capilliary action) the captured antibody is detected by any means 
which will give a detectable signal. For example, this may be 
achieved by use of a labelled molecule or particle as described above 
which will react with the captured antibody (for example protein A or 
protein G and the like: anti-species or anti^iiwmmo globulin- 
sub- type; rheumatoid factor; or antibody to the antigen, used in a 
competitive or blocking fashion) , or any molecule containing an 
epitope contained in the polypeptide. 

The detectable signal may be optical or radioactive or physico- 
chemical and may be provided directly by labelling the molecule or 
particle with, for example, a dye, radiolabel, electroactive species, 
magnetically resonant species or fluorophore, or indirectly by 
labelling Che molecule or particle with an enzyme itself capable of 
giving rise to a measurable change of any sort. Alternatively the 
detectable signal may be obtained using, for example, agglutination, 
or through a diffraction or birefringent effect if the surface is in 
the form of particles. 

Assays in which a PT-NANBH viral polypeptide itself is used to label 
an already captured antibody require some form of labelling of the 
antigen which will allow it to be detected. The labelling may be 
direct by chemically or passively attaching for example a radio label, 
magnetic resonant species, particle or enzyme label to the 
polypeptide; or indirect by attaching any form of label to a molecule 
which will itself react with the polypeptide. The chemistry of bonding 
a label to the PT-RASBH viral polypeptide can be directly through a 
moiety already present in the polypeptide, such as an amino group, or 
through an intermediate moiety, such as a maleimide group. Capture of 
the antibody may be on any of the surfaces already mentioned by any 
reagent including passive or activated adsorption which will result in 
specific antibody or immune complexes being bound. In particular, 
capture of the antibody could be by anti-species or 
anti-immunoglobulin-sub-type, by rheumatoid factor, proteins A. G and 
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the like, or by any molecule containing an epitope contained in the 
polypeptide, 

The labelled PT-NANBH polypeptide may be ixsed in a coxspetitive binding 
fashion in which its binding to any specific molecule on any of the 
svtrfaces exemplified above is blocked by antigen in the sample. 
Alternatively, it may be used in a non-competitive fashion in which 
antigen in the sample is bound specifically or non- specifically to any 
of the surfaces above and is also bovxid to a specific bi- or 
poly-valent molecule (e.g. an antibody) with the remaining valencies 
being used to capture the labelled polypeptide. 

Often in homogeneous assays the PT-NANBH viral polypeptide and an 
antibody are separately labelled so that, when the antibody reacts 
with the viral polypeptide in free solution, the two labels interact 
to allow, for exainple, non-radiative transfer of energy captured by 
one label to the other label with appropriate detection of the excited 
second label or quenched first label (e,g, by fluorimetry, magnetic 
resonance or enzyme measurement) • Addition of either viral 
polypeptide or antibody in a sample results in restriction of the 
interaction of the labelled pair and thus in a different level of 
signal in the detector. 

A suitable assay format for detecting PT-NANBH antibody is the direct 
sandwich enzyme immunoassay (EIA) format. A PT-NAKBH viral 
polypeptide is coated onto microtitre wells, A test sample and a 
PT-NA2?BH viral polypeptide to which an ensyme is coupled are added 
simultaneously. Any PT-NANBH antibody present iti the test sample 
binds both to the viral polypeptide coating the well and to the 
enzyme -coupled viral polypeptide. Typically, the same viral 
polypeptide is used on both sides of the sandwich. After washing, 
bound enzyme is detected using a specific substrate involving a colour 
change. A test kit for use in such an EIA comprises: 

(1) a PT-NANBH viral polypeptide labelled with an enzyme; 
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(2) a substrate for the enzyme; 

(3) meatis providing a surface on which a PT-NANBH viral polypeptide 
is imobilised; and 

(4) optionally, washing solutions and/or buffers. 

The viral polypeptides of the present invention may be incorporated 
into a vaccine fonaulation for inducing immunity to PT-NAKBH in man. 
For this purpose the viral polypeptide may be presented in association 
with a pharmaceutically acceptable carrier. 

For use in a vaccine formlation. the viral polypeptide may optionally 
be presented as part of an hepatitis B core fusion particle, as 
described in Clarke ^ al (Nature . 1987, 130, 381-384), or a 
polylysine based polymer, as described in Tam (PNAS, 1988 , 85, 
5409-5413), Alternatively, the . viral polypeptide may optionally be 
attached to a particulate structure, such as liposomes or ISCOKS* 

Pharmaceutically acceptable carriers include liquid media suitable for 
iise as vehicles to introduce the viral polypeptide into a patient. An 
example of such liquid media is saline solution. The viral 
polypeptide itself may be dissolved or suspended as a solid in the 
carrier. 

The vaccine formulation may also contain an adjuvant for stimulating 
the immune response and thereby enhancing the effect of the vaccine. 
Examples of adjuvants include aluminium hydroxide and aluminium 
phosphate . 

The vaccine formulation may contain a final concentration of viral 
polypeptide in the range from 0.01 to 5 mg/ml, preferably from 0.03 to 
2 mg/ml. The vaccine formulation may be incorporated into a sterile 
container, which is then sealed and stored at a low temperature, for 
example 4^C, or may be freeze -dried. 
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In order to induce inimunity in man to PT-NANBH. one or more doses of 
the vaccine formulation may be administered. Each dose may be 0.1 to 
2 ml, preferably 0.2 to 1 ml, A method for inducing immunity to 
PT-RAHBH in man, comprises the administration of an effective amount 
of a vaccine formulation, as hereinbefore defined. 

Ifae present invention also provides the use of a PT-NANBH viral 
polypeptide in the preparation of a vaccine for use in the induction 
of immunity to PT-KANBH in man. 

Vaccines of the present invention may be administered by any 
convenient method for the administration of vaccines including oral 
and parenteral (e.g. intravenous, subcutaneous or intramuscular) 
injection. The treatment may consist of a single dose of vaccine or a 
plurality of doses over a period of time. 

The following transformed strains of &^£2li were deposited with the 
National Collection of Type Cultures (NCTC) . Central Public Health 
Laboratory. 61, Colindale Avenue, London, IIW9 5HT on the indicated 
dates : 

i) E. coli TGI transformed by pDX113 (WDOOl) ; Deposit Ho. NCTC 
12369; 7th December 1989 

il) P-coli TGI transformed by pDX128 (WD002); Deposit No. NCTC 12382; 
23rd February 1990. 

ill) p. coli TGI transformed by pl36/155 CWD003); Deposit No. NCTC 
28th November 1990. 

iv) E.coli T61 transformed by pl56/92 (WD004); Deposit No. NCTC ; 
28th November 1990. 

V) E.coli TGI transformed by pl29/164 (WD005); Deposit No. NCTC 
28th November 1990. 



HIS/AC/12th December 1990 



19 . PA1121 



vl) iLcsli TGI transformed by pDX136 (WD006) ; Deposit Ko. NCTC ! 
28th November 1990. 

In the Figures, Figure 1 shows a representation of the production of 
PDX122 described in Example 7, Figure 2 shows a representation of the 
production of two alternative fused sequences described in Example 17. 
and Figure 3 shows restriction maps of SEQ ID NO : 21 and 22. 

in the Sequence Listing, there are listed SEQ ID NO : 1 to 25 to which 
references are made in the description and claims. 

The following Exainples serve to illustrate the invention. 
pt AMPLE 1 , Synthesis of cDNA 

Pooled plasma (160 mis) from two individuals (referred to as A and L) 
known to have transmitted NAHBH via transfusions was diluted (1:2.5) 
with phosphate buffered saline (PBS) and then centrifuged at 190.000s 
(e g 30,000rpm in an MSE 8x50 rotor) for 5hrs at 4'C. The supernatant 
was retained as a source of specific antibodies for subsequent 
screening of the cDNA libraries. The pellet was resuspended in 2mls 
of 20mM tris-hydrochloride. 2mM EDiA 3% SDS, 0.2M NaCl (2xPK) 
extracted 3 times with an equal volume of phenol, 3 times with 
chloroform, once with ether, and then precipitated with 2.5 volumes of 
ethanol at .20'C. The precipitate was resuspended in mi of lOmM 
tr is -hydrochloride, ImM EDTA at pH 8.0 (TE), 

The nucleic acid was used as a template in a cDNA synthesis Mt 
(Amersham International pic, Amersham. U.K.) with both oligo-dT and 
random hexanucleotide priming. The reaction conditions were as 
recommended by the kit supplier. Specifically, lul of the nucleic 
acid was used for a first strand synthesis reaction which was labelled 
with [a-^h]6CT? (Amersham; specific activity 3000Ci/mmol) in a final 
volume of 20ul and incubated at A2-C for 1 hour. The entire first 
strand reaction was then used for second strand synthesis reaction. 
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containing S. fisOi RNaseH (0.8 U) and DNA polymerase 1 (23 U) in a 
final volume of lOOul, incubated at 12*C for 60 minutes then 22'C for 
60 minutes. The entire reaction was then incubated at 70«C for 10 
minutes, placed on ice, 1 13 of T4 DNA polymerase vas added and then 
incubated at 37'C for 10 minutes. The reaction was stopped by 
addition of 5ul of 0.2M EDTA pH8. 

Unincorporated nucleotides were removed by passing the reaction over a 
KICK column (Pharmacia Ltd. Milton Keynes, U.K.) The cDNA was than 
extracted twice with phenol, three times with chloroform, once with 
ether and then 20 (tg dextran was added before precipitation with 2.5 
volumes of 100% ethanol. 

^q^PLE 2. Production of Ex pysgsion Libraries 

The dried cDKA pellet was resuspended in 5ul of sterile TE and then 
incubated with SOOng of EcoRI linkers (Pharmacia; GGAATTCC 
phosphorylated) and 0.5 U of T4 DNA ligase (New England BioLabs. 
Beverley, MA, USA) in final volume of 10^1 containing 20mM Tris-HCl 
. pH7.5, lOmM MgClj,, lOraM DTT, InH ATP for 3 hours at 15«C. The ligase 
was inactivated by heating to 65*0 for 10 minutes and the cDNA was 
digested with 180U of EcoRI (BCL, Lewes, U.K.) in a final volume of 
100;«1 at 37*C for 1 hour. EDTA .was added to a final concentration of 
lOmM and the entire reaction loaded onto an AcA34 (LKB) column. 
Fractions (50/il) were collected and counted. The peak of cDRA in the 
excluded volume (980 cpm) was pooled, extracted twice with phenol, 
three times with chloroform, once with ether and then ethanol 
precipitated. 

The ds cDRA was resuspended in 5/il TE and ligated onto lambda gtll 
EcoRI arms (Gibco, Paisley, Scotland) in a 10^*1 reaction containing 
0.5U T4 DNA ligase, 66 mM tris-hydrochloride. lOmH MgClj, ISroM DTT pH 
7.6 at 15«C overnight. After inactivating the ligase by heating to 
65'C. for 10 minutes. 5ul of the reaction were added to an Amersham 
packaging reaction and incubated at 22»C for 2 hours. The packaged 
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material was titrated on £. sslL strain Y1090 (Huynh s£ H 1985) and 



4 

contained a total of 2.6x10 recombinants, 



Plating cells (Y1090) were prepared by inoculating 10 mis L- broth 
with a single colony from an agar plate and shaking overnight at 37'C. 
The next day O.Smls of the overnight culture were diluted with lOmls 
of fresh L-broth and O.lal IM MgSO^ and 0.1ml 20%(w/v) maltose were 
added. The culture was shaken for 2 hours at 37»C, the bacteria 
harvested by centrifugatton at S.OOOg for 10 minutes and resuspended 
in 5 mis lOaiM MgSO^ to produce the plating cell stock. A portion 
(lul) of the packed material was mixed wich 0.2ml of plating cells, 
iticubated at 37'C for 20 minutes before 3 mis of top agar were added 
and the entire mixture poured onto a 90mm L-agar plate. After 
overnight incubation at 37-C plaques were counted and the total number 
of recombinant phage determined. The remaining packaged material 
(500ul) was stored at WO, 

Additional libraries were prepared in a substantially similar manner. 



RYAMT >T.E 3. yrraenine --.f p.vipresslon Llbrarteq 

The initial library described in Example 2 was plated out onto £. caU 
strain Y1090 at a density of about 5x10^ pfu per lUOm. plate and grown 
at 37 'C for 2 hours until the plaques were visible. Sterile 
nitrocellulose filters which had been impregnated with IPTG 
(isopropylthiogalactoside) were left in contact with the plate for 3 
hours and then removed. The filters were first blocked by incubation 
with blocking solution [3%(w/v)BSA/TBS.TweenC10nM Tris-HCl pH8. ISOmH 
HaCl. 0.05%(v/v) Tween 20) containing 0.05% bronidox] (20mls/filter) 
and then transferred to binding buffer ll%(w/v)BSA/TBS/Tween 
containing 0,05% bronidox] containing purified <by ion-exchange 
chromatography) antibodies from pooled A & L plasma (20,ig/ml) . After 
incubation at room temperature for 2 hours the filters were washed 
three times with TBS-Tween and then incubated in binding buffer 
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containing biotinylated sheep anti -human (1:250). After 1 hour at 
room temperature the filters were washed 3 times with TBS/Tween and 
then incubated in binding buffer containing streptavidin/peroxidase 
complex (1:100). The signal developed with DAB. Positive signals 
appeared as (coloured) plaques. 

Out of a total of 2.6 x 10* plaques screened. 8 positives were 
obtained on the first round screen. Using the filters as a template, 
the regions of the original plates corresponding to these positive 
signals were picked effusing a sterile pasteur pipette. The agar 
plugs were suspended in 0.1 ml of SM buffer and the phage allowed to 
diffuse out. The titre of phage from each plug was determined on £. 
coll strain Y1090. The phage stock from each plug was then 
re-screened as before on individual 90mm plates at a density of about 
1 X 10^ pfu per plate. Of 8 first round positives, one was clearly ■ 
positive on the second roimd. i.e. >1% of plaques positive, this was 
called JG2. This corresponds to a positive rate of 40/10 In the 
library . 

This and other positive phage identified in an similar way from other 
cDNA libraries described in Example 2 were then purified by repeated 
rounds of plaque screening at lower density (1-200 pfu/90m plate) 
until 100% of the plaques were positive with the A&L antibody screen. 
Three such recombinant phage were JGl, JG2 and JG3. 

c y.MPT.R 4. Se < -»r.^.rv S,>.r.«T.tTr "^"^ "^'^ ""^^^ '^'^^ 

Each of the recombinant phage, JCl. JG2 and JG3. were plaque purified 
and stored as titred stocks in SM buf f er at 4-C. These phage were 
mixed (1:1) with a stock of phage identified as negative in Example 3 
and mixture used to infect £. coH strain Y1090 at 1000 pfu per plate. 
Plaque lifts were taken and processed as described in Example 3 except 
that the filters were cut into quadrants and each quadrant was 
incubated with a different antibody; these were A&L antibodies 
(20p6/ml): A plasma (1:500); L plasma (1:500) and H IgG (20Mg/ml). H 
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is a patient expected to be positive for PT-NANBH antibodies because 
he was a haemophiliac who had received non -heat- treated Factor VIII. 
At the end of the reaction each filter was scored blind as positive 
(when there were clearly two classes of signal) or negative (when all 
plaques gave the sane signal) . This could be a subjective judgement 
and so the scores were compared and only those filters where there was 
a najority agreement were taken as positive. The results 
presented in Table 1. 



are 



H 



A&L A L 

JGl + + - 

JG2 + + + + 

JG3 + + + + 

JGl appeared only to react with antibodies from patient A and not L or 
H; this is not what would be expected of a true PT-NANBH related 
recombinant polypeptide and so JGl was dropped from the analysis. 
However both ,JG2 and JG3 gave clear positive reactions with three 
PT-NAHBH sera A, L and H; these were analysed further. 

The type of analysis described above was repeated for JG2 and JG3 
except that the filters were cut into smaller portions and these were 
incubated with panels of positive and negative sera. The panels of 
positive sera comprised one panel of 10 haemophiliac sera and one 
panel of 9 intravenous drug addict (IVDA) sera. These represented the 
best source of positive sera even though the actual positive rate was 
unknown. The panel of negative sera was obtained from accredited 
donors who have been closely monitored over many years by the North 
London Blood Transfusion Centre. Deansbrook Road. Edgware. Middlesex. 
U.K. and have never- shown any sign of infection with a variety of 
agents including FT - NANBH. The results are presented in Tables 
2 & 3. 
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I.D. JG2 J63 

ITOAs V19146 it/i 0/5 

V27083 2/4 . 0/5 

V29779 0/4 0/5 

V12561 0/5 &a 

V15444 1^ 

V18342 &Jk 0/5 

V8403 0/5 

V20001 4/4 0/5 

V21213 m. 

Haemophiliacs M1582 ^ 

M158I 5Zi 

M1575 m 0/5 

111579 ^ 

H1585 aZS 0/5 

M1576 1/5 1/5 

M1580 1/5 0/5 

M1578 1/5 0/5 

M1587 1/5 

M1577 2/5 1/5 

Positives are underlined. 

Tf BLE 3 

Accredited 

IVDA Haemophiliac Donor 

JG2 6/9(66%) 5/10(50%) 0/10(0%) 

JG3 2/9(22%) . 4/10(40%) 0/10(0%) 

JG2-hJG3 1/9(11%) 3/10(30%) 0/10(0%) 

JG2 or JG3 7/9(77%) 6/10(60%) OA0(0%) 
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These data are consistent vith the hypothesis that both reooinblnants 
are expressing polypeptides associated with an agent responsible for 
PT-NANBH and that these polypeptides are not identical but n.ay share 
some antigenic sites . 

A portion (10^1) of the phage stocks for both JG2 and JG3 was boiled 
to denature the phage and expose the DM. This DNA was then used aa a 
template in a PCR amplification using Taq polymerase; each reaction 
cont:ained the following in a final volume of SOul:- lOnM Tris-HCl. 
50m KCl. l.Sm MgCl^. 0.01% gelatin. pH 8.3 at 25'C plus 
oligonucleotide primers dl9 and d20 (SEQ ID NO : 1 and 2 respectively^ 
200ngeach); these primers are located in the lambda sequences 
flanking the Eco RI cloning site and therefore prime the amplif Icatxon 
of anything cloned into this site. 

A portion of the reaction was analysed on a 1.0% agarose gel and 
compared to markers. Amplification of JG2 produced a fragment of 
approximately 2Kb; JG3 one of approximately 1Kb. The remaining 
reaction mix was extracted with phenol/chloriform in the presence of 
lOmM EDTA and 1% SDS and the DM recovered by ethanol precipitation. 
The amplified material was aien digested with 20U of EcoRl for 60 
minutes at 37-C and separated on a 1.0% LGT agarose gel in TAE. The 
fragments were reduced in size as expected and were eluted and 
purified using Elutips (S6.S) . The JG2 and JC3 inserta were ligated 
with EcoRI digested pUCl3 and transformed into £. ££li strain TGI. 
Recombinants were identified as white colonies on X-gal/L-Amp plates 
(L-Agar places supplemented with 100 Mg/ml ampicillin. 0.5 mgAl 
X-gal) and were checked by smaU-scale plasmid preparations and EcoRI 
restriction enzyme digestion to determine the size of the insert DNA. 
The recombinant plasmid containing the JG2 insert was called DM415 and 
that containing the JG3 Insert was called DM416. 
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The sequence of the JG2 insert was determined by direct 
double-stranded sequencing of the plasmld DNA and by subcloning into 
H13 sequencing vectors such as mplS and wpl9 followed by 
single-straiided sequencing. The sequence of the JG3 insert was 
similarly determined. The ' resulting DNA- and deduced aminoacid 
sequences are set forth in SEQ ID NO : 3 and 4. 

FyftM?T.F. 6. acpresslon of ?T-NAHB H Polvneotlde in S,eoli 

The plasmid pDM416 (5ug) was digested with EcoRI (20U) in a final 
volume of 20ul and the 1Kb insert recovered by elution from a 1% LGT 
agarose gel. This material was then "polished" using Klenow fragment 
and a dNTP mix to fill in the EcoRI overhanging ends. The DNA was 
recovered by ethanol precipitation following extraction with 
phenol/chloroform. The blunt- ended fragment was ligated into Smal 
cleaved/phosphatased pDE7107 (a vector which permits cloning at the 3' 
end of lM& Z) and then transformed into I- £sli TGl cells. There was 
a SO-fold increase in colonies over a vector-alone control. 
Tranaformants containing the required recombinant plasmid were 
identified by hybridisation with a radioactive probe produced by PGR 
amplification of the JG3 recombinant. Twelve colonies were analysed 
by restriction enzyme digestion (Sail) of plasmid aini-preparations to 
determine the orientation of the insert. A quarter of these 
recombinants were in the correct orientation to express the PT-NANBH 
sequence as a fusion with ^-galactosidase. One of these (pDX113) was 
taken for further analysis. 

A colony of pDX113 was used to inoculate 50 mis L-broth, grown at 37'C 
with shaking to mid-log phase and expression induced by addition of 
20mM IPTG. After 3 hours the cells were harvested by centrifugatlon 
at 5,000g for 20 minutes, resuspended in 50 mis PBS and repelleted. 
The pelleted cells were resuspended in 5 mis of buffer <25mM Tris-HCl, 
InW EDTA. img/ml lysozyme, 0.2%(vA) Konidet-PAO, pHS.O) per gram of 
pellet and incubated at 0"C for 2 hours. The released bacterial DNA 
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was digested by addition of DNase I and MgSO^ to final concentrations 
of 40ug/ml and 2fflM respectively to reduce viscosity. 

This crude lysate vas analysed by PAGE and the pattern of proteins 
stained with Coomassie blue. A protein of • approximately 150kD was 
induced in bacteria containing pDX113 and this protein was estimated 
to account for 10-15% of the total protein. Similar gels were 
transferred to FVDF inembrane (GRI, Dunmow. Essex. O.K.) and the 
membranes incubated with PT-NANBH-positive and negative sera; the 
150kD protein reacted with the A and L sera but not normal human 
serum. Control traclcs containing lysate from £. SSill expressing 
/S.galaotosldase did not react with A. L or normal human sera. 

Urea was added to the crude lysate to a final concentration of 611 and 
insoluble material removed by centrifugation. The 6H urea extract was 
used to coat microtitre wells directly for 1 hour at 37-C. The wells 
were washed three times with double- distilled water and then blocked 
by additioT^ of 0.25ml of 0.2% BSA per well containing 0.02% NaNj for 
20 minutes at 37'C. The plate was then aspirated. Control plates 
coated with, a crude lysate of a ^.galactosidase-producing L coli 
strain (pm61) were produced in the same way. These plates were used 
in ELISA assays as described in Example 10. 

P^AMPLE 7. f^^^^^i^r. of ^-WAWRH PolvpPT>n-<1p 1n Tp.'^eet Cell s 

The PT-NANBH insert from JG3. isolated as described in Example 5, was 
cloned in-frame with the first 34 nucleotides of polyhedrin in the 
vector pAc360 (Luckow and Summers. liatgsimalasX. X988. 47-55). 
utilising our knowledge of the reading frame of the iacZ gene in the 
gtll vector. Oligonucleotides were synthesised which were able to 
hybridise to gtll sequences flanking the EcoRI cloning site and which 
would enable the amplification of the insert by PGR. These 
oligonucleotides included BamHI restriction sites suitably placed to 
allow direct cloning into the BamHI site of pAc360, placing the 
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Inserted gene in-frame with the amino terminal sequences of 
polyhedrin. 

A small amount of the gtll recombinant JG3 was boiled to expose the 
DBA and then used in a PGR amplification containing the 
oligonucleotide primers d75 and d76 (SEQ ID NO : 6 and 7; 200ing) and 
0.5TJ of Taq polymerase. 

After amplification, the reaction was extracted with an equal volume 
of phenol/chloroform, ethanol precipitated and digested with lOU 
BamHI in a final volume of 30al. The amplified fragment was resolved 
on a 1% agarose gel. eluted and ligated into BamHI-digested pAc360 to 
produce the transfer construct pDXll9. The recombinant plasmld (2ug) 
and wild- type AcRPV DNA (lug) were co-transf acted into insect cells, by 
calcium phosphate precipitation. Inclusion negative recombinant virus 
was selected by visual screening. After three rounds of plaque 
purification, the recombinant virus (BHG-5) was expanded and 
expression of recombinant protein in insect cells was assessed by 
SDS-PAGE. Western blot and EUSA. An abundantly expressed protein of 
approximately 70kD in produced in infected cells. This protein is 
reactive with PT-NAHBH sera by Western blot and EUSA. 

A further baculovirus recombinant (BHC-7) was constructed to include 
JG2 sequences additional to the JG3 sequences present in BHC-5, as 
depicted in Figure 1. The PT-NANBH sequences present in JG2 were 
amplified and cloned into the pAc360 vector as described above to 
produce PDX118 and the appropriate Bam Hl/Sal I fragments of pDX119 
and pDXllS were linked together in that order in pAc360 to produce the 
transfer construct pDX122. 

Recombinant plasmids were identified by hybridisation and orientation 
of Inserted DNA determined by restriction enzyme analysts. 
Recombinant virus was produced as described above and the expressed 
protein analysed by SDS-PAGE. Western blot and ELISA. A very abundant 
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(40% total cell protein) 95kDa polypeptide which reacted with PT-NANBH 
sera was found in infected cells. 

p^ ^.S 8, Purlflgation of DXX13 Polypeptide 

I. £sli strain TGI containing the plasmid pDXll3 (designated strain 
TOLOOl) was grown and induced in a 1.5 litre fermenter (model SET002, 
SGI. Newhaven. East Sussex, U.K.) at 37'G for 5 hours. The cells were 
harvested by centrifugation at 5,000g for 20 minutes and treated as 
follows . 

a) Extraction. 

The wet cells are resuspended (1:20, w/v) in Buffer A (SOnM 
Tris-HCl, 50mM NaCl. ImM EDTA. 5mM DTT, 10%(v/v) glycerol, 
pH8.0). Lysozyme was added at 5ng solid per ml of suspension and 
the mixture left at 4'C. After 15 minutes, the mixture was 
sonicated {^vm pealc-to-peak amplitude) on ice for a total of 3 
minutes (6x 30 sec bursts) . DBase 1 was added at 4ug per ml 
suspension and the mixture left for a further 30 minutes. The 
suspension was centrifuged for 20 minutes at 18,000g(max) and the 
supernatant discarded. 

The pellet was resuspended in buffer B (25mM Hepes, 4M urea. 5aM 
DTT, pH 8,0) at a ratio of 1:6 (w/v) to obtain a fine suspension. 
This was centrifuged at 18,000g(max) for 20 minutes and the 
supernatant discarded. The pellet was resuspended in buffer C 
(25mM Hepes, 8M urea, 2xM DTT, pH 8.0) at a ratio of 1:6 (w/v); 
before suspension the following are added:- leupeptin (lug/ml), 
pepstatin (lug/ml) and E64 (lug/ml). The suspension was 
centrifuged at 18,000g(max) for 30 minutes and the supernatant 
decanted and kept. The pellet was resuspended in 25iaM Hepes , 1% 
SDS pH 8,0, 
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b) ChroBatography. 

The supernatant from the 8M urea fraction was diluted 1:5 (v/v) 
in 25iaM Hepes, 811 urea, 2aM DTT. pH 8.0 and fractionated on a 7ml 
Q-Sepharose column. Proteins were eluted via a salt gradient of 
0-lM NaCl. The chromatography and .data manipulation were 
controlled by an FPLC (Pharmacia) . DX113 elutes at approximately 
SOOoM NaCl and is virtually homogeneous by SDS Page and Western 
blot analysis. 

EXAMPLE 9. Purlflcai -^,nn of BHC-5 Polypeptide 

Sf9 cells (2x10^) were infected with a stock of the BHC-S recombinant 
virus (Boi 5) . After incubation at 28°C for 2 days the cells were 
harvested by oentrifugation and then processed as follows. 

a) Extraction. 

The wet cell mass (1.2g) was resuspended in 6mls of buffer A 
(25nM Hepes, SmM DTT, leupeptin Ipg/ml. pepstatln l;tg/ml, E64 
Ipg/ml pH 8.0). The resuspended cells were placed on ice and 
sonicated for 3 x 15 seconds bursts (6pm peak-to-peak amplitude) 
interspersed with 30 second rest periods. The sonicated 
suspension was centrifuged at 18.000g(max) for 20 minutes and 
the supernatant discarded. The pellet was resuspended in buffer 
A plus AM urea (6mls) and centrifuged at IS.OOOg (max) for 20 
minutes. The supernatant was discarded and the pellet 
re-extracted with buffer A plus 8M urea (6ml). After 
centrifugation at IS.OOOg (max) for 30 minutes the supernatant 
was retained and diluted 1:6 in buff er A plus 8M urea. This 
extract was chromatographed on a mono-Q column equilibrated in 
the same buffer. The column was eluted via a salt gradient 
(0-1. OM NaCl) over 12 column volumes. BHC-5 eluted at 
approximately 0.45 - 0.55m HaCl and was greater than 90% pure as 
judged by SDS-PAGE. The yield, was approximately 70%. 
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RYAMPLE 10. 

^Arfnrmanee » f nvil 3 and ^M f'^ ^"'^ ^ Pol vneptideg »q BMSA 

Microellsa plates (96 well, Hunc) vere directly coated in 50b^ 
bicairbonate buffer (SOfflM sodium bicarbonate and 50nM sodium carbonate . 
titrated to pH 9.5) with either a crude 6M urea lysate of BHC-5 or 
with purified pDXm. Plates were blocked with 0.2% BSA and then 
incubated for 30 minutes at 37-C with sera diluted 1:20 (baculo) or 
1:100 I£. sail). After washing in Tween-saline (0.85% saline, 0.05% 
Tween 20, 0.01% Bronidox) plates were incubated with 
peroxidase-conjugated goat anti-human imunoglobulin (1:2000) for 30 
minutes at 37«C. Plates were then washed in Tween-sallne and colour 
developed by adding the chromogenic substrate TMB (tetr«net:hyl 
benzidine-HCl) (lOO^l/well) and incubating for 20 minutes at room 
temperature. The reaction was stopped with 50^1 2M sulphuric acid and 
the OD450 determined (Table 4;) 



Baculo IjJSSii 

BHC.5 (Solid phase) DX113 (Solid phase) 



Sera from higji risk 
patients positive 
in the assay 



>2 


1.670 


1.855 


1.531 


1.081 


1.015 


1.842 


1.558 


0.526 


0.638 


>2 


1.516 


1.823 


1.602 


1.779 


1.318 


1.122 


0.616 


1.686 


.1.441 
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Sera from high risk 
patients negative 
In the assay 



0.259 


0.205 


0.158 


0.120 


0.298 




0.194 


0.111 


0.282 


0.181 


0.263 


0.165 


0.184 


0.163 


0.121 


0.099 


0.243 


0.104 


0.224 


0.119 



Accredited donor 

Sera from patients at high risk of PT-NANB infection (IVDA's. 
haemophiliacs) were assayed as described; all data are expressed as 
OD450 readings with the accredited donor as a negative control. Of 
this particular group of sera 10/19 are positive on both solid phases. 

Additionally purified DX113 was conjugated to alkaline phosphatase 
xising ShTh/maleij^tdB reduction and an iramunometric assay was 
established. Known NANB positive and negative sera were diluted as 
indicated in accredited donor serum and added to a BHC-7 coated solid 
phase. Either simultaneously or after incubation (30 minutes at 37 ''C) 
theDXUS conjugate was added (50/il. 1:2000). After incubation at 
37 ^'C for 30 minutes, plates were washed with 50m bicarbonate buffer 
and colour developed using the IQ Bio amplification system and the 
OD492 determined (Table 5) 
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atltlbodv 



Assay 

>2 



Assay 

0.217 0'234 



0.821 0-252 

>2 0.214 

0.542 . 0.257 

0.S76 0.308 

1.583 0.278 

>2 0.296 

>2 0-273 

1.830 0.262 

>2 0-251 



Thus with either assay format - antlglobuUn or innnunometric - all the 
high risk samples gave concordant results. 
f^ftl fPT-R 11 - v«f--f-ine Formulation 

A vaccine formulation may he prepared by conventional techniques using 
the following constituents in the indicated amounts: 

PI-NANBH Viral polypeptide > 0.36 mg 

Thiomersal 0.04.0.2 mg 

Sodium Chloride < 8.5 mg 

Water 

« 

. ^.....ri.r. of Mo r-'- n"-' ^^Hh»dinn to VT-mm FolYpept.des , 
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The DBA insert from DH415 was sub- cloned into the baculovirus transfer 
vector p36C and recombinant virus produced by a method essentially 
siMlar to that described in Example 7. The recombinant virus was 
called BHC-l and expressed very low levels of PT-NANBH-specific 
protein. Sf-9 cells (5x10^ cells/ml) infected with BHC-1 were lysed 
in PBS containing 1% (v/v) NP40 and spun at 13000g for 2 minutes. The 
supernatant was passed over Extractigel-D (Pierce Chemicals) to remove 
detergent and then mixed as a 1:1 emulsion with Freund's complete 
adjuvant. Mice were injected subcutaneously with 0.1ml of emulsion 
(equivalent to 5x10^ cells). At 14 and 28 days post- injection, the 
mice were boosted by intraperitoneal injection of 0.1ml (equivalent to 
5x10^ cells) of a detergent-free extract of BHC- 5 -infected Sf-9 cells: 
BHC-5 contains the DNA insert of DM416. Test tail bleeds were taken 
and assayed for anti-PT-NANBH activity in an EUSA (Example 10). Two 
mice with a PT-NANBH-specific response were further boosted by i.v. 
injection with a detergent- free extract of BHC- 7- infected Sf-9 cells; 
BHC.7 contains a DNA insert produced by ligating together the 
overlapping regions of DM415 and DM416 (Example 7) . The spleens were 
removed three days later. 

Spleen cells were fused with NSo myeloma cells in the presence of 
PEG1500 by standard techniques. The resulting hybrldoma cells were 
selected by growth in HAT (hypoxanthine , aminopterin. thymidine) 
medium. At 10-14 days post-fusion, supematants were screened for 
anti-PT-HANBH activity by EUSA. Wells which showed reactivity with 
bothmi3 and BHC-7 antigens (Example 10) were Identified and 
individual colonies were transferred to separate wells, grown and 
re-tested. Wells, which showed specific reactivity at this stage were 
further cloned at limiting dilution to ensure monoclonality. 

BYAMPl.E 13. py-f^r.«.io« Of ^.WAWRH Vir>^1 TTticlelc Ac1<^ fp Seyppp^jtiye 
Patients 

Sera: Donation samples from 1400 donors, enrolled into . a prospective 
study of post- transfusion, hepatitis, were frozen at -20 C. 
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Pre. transfusion and serial post- transfusion samples from tiie 260 
recipients were similarly stored. The post-transfusion samples vsre 
collected fortnightly until 3 months, monthly until 6 months and 5 
monthly thereafter, until 18 months. Frozen donor and recipient sera 
from three incidents of PT-NANBH that occurred in 1981 were also 
available for study. The diagnosis of PT-NANBH was based on a rise in 
serum alanine amino transferase (ALT) to exceed 2.5 times the upper 
limit of normal in at least two separate post -trans fusion samples. 
Other hepatotroplc viruses were excluded by serological testing and 
non-viral causes of hepatocellular injury were excluded by 
conventional clinical and laboratory studies. 

Immunoassay: Serum samples were tested retrospectively for the 
presence of antibodies to HCV (ClOO antigen) with the Ortho 
Diagnostics EUSA kit used in accordance with the manufacturer's 
instructions. Repeatedly reactive sera were titrated to end points in 
a human serum negative for anti-ClOO. 

Detection of PT-NASBH Viral Sequences: Serum or plasma SNA was 
extracted, reverse transcribed, and amplified as described below. The 
reverse transcription/PCR oligonucleotide primers were derived from 
the nucleotide sequence of the JG2 clone isolated in EXAMPLE 3. and 
synthesised on an Applied Biosystems 381A synthesiser. The sequences 
of the four oligonucleotide primers were as follows: 



pP-Bipiation 



d94 sense 



d95 antisense 



^vn TP HP : fr^duct Sl?e 



729bp 



Nl sense 
N2 antisense li 



10 

402bp 
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(i) RHA Extraction 



5.50^1 of serum (or plasma) was made up to 200^1 by addixxg ■ 
sterile distilled water. The 200pl sample was added to an equal 
volume of 2 X PK hv£tex (2 x PK - 0. 2M.TrisCl. pH7.5. 25idM EDTA. 
0.3M NaCl. 2% w/v SDS. proteinase K 200MS/ml), mixed and 
incubated at 37°C for 40 minutes. Proteins were removed by 
extracting twice with phenol/chloroform and once with chloroform 
alone. 20pg glycogen were added to the aqueous phase and the RNA 
then precipitated by addition of 3 volumes of ice-cold absolute 
ethanol. After storage at .70°C for 1 hour the RNA was^ pelleted 
in an Eppendorf centrifuge <15 minutes, 14000 rpm, 4 C). The 
pellet was washed once in 95% ethanol, vacuum desiccated and 
dissolved in lOpl of sterile distilled water. BHA solutions were 
stored at -70°C. 

(ii) cDNA Synthesis 

A lOMl mixture was prepared containing 2^1 of the RNA solution. 
50ng of the synthetic oligonucleotide d95. lOnM Hepes-HCl pH6.9 
and 0.2mM EDTA PH8.0. This 10^1 mix was overlayed with 2^ drops 
of mineral oil. heated for 2 minutes in a water bath at 90 C and 
cooled rapidly on ice. cDNA synthesis was performed . after 
adjusting the reaction to contain 50mM Tris-HCl pH7.5. 75mM KCl. 
3mM MgCl,, lOmM DTT. 0.5m« each of dATP. dCTP. dGXP and dTTP. 20 
units of RNase inhibitor (Pharmacia) and 15 units of cloned MLV 
reverse transcriptase (Pharmacia) in a final volume of ZO^l. The 
20pl mix was incubated at 37°C for 90 minutes. Following 
synthesis the cDNA was stored at -20 C. 

(iii) "Nested" PGR 

Throughout this study false positive PGR results were avoided by 
strict application of the contamination avoidance measures of 
Kwok and Higuchi (Nature . 1989, 139, 237-238). 
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a) Round 1 

The polymerase chain reaction was performed in a 50;il mix 
containing lOmM Tris-HCl pH8.3. 50mM KCl, 1.5nM MgCl^, 0.01% 
w/v gelatin, 1 Unit Recombinant Taq DNA polymerase (Perkln 
Elmer Cetus) , 200,iM each dNTP, 30ng of each 'outer' primer 
(d94 and d95; SEQ ID HO : 8 and 9 respectively) and 5^1 of 
the cDNA solution. After an initial 5 minute denaturation 
at 94°C. 35 cycles of 95°C for 1.2 minutes. U'^C for 1 
minute, 72°C for 1 minute were carried out, followed by a 
final 7 minute extension at 72°C (Techne PHC-1 Automated 
Thermal Cycler). 

b) Round 2 

The reaction mix was as described above for Round 1 but 
. 125ns of each 'inner' primer. HI and N2 (SEQ ID NO : 10 and 
11 respectively), was used instead of the 'outer' primers 
d94andd95. A Ipl aliquot of the Round 1 PGR products was 
transferred to the Round 2 50^1 reaction mix. 25 cycles of 
95°C for 1.2 minutes. 46*^0 for 1 minute. 72°C for 1 ^minute 
were performed followed by a 7 minute extension at 72 C. 
c) Analysis 

20,il of the Round 1 and Round 2 PGR products were analysed 
• by electrophoresis on a 2% agarose gel. Bands were 
visualised by ethidium bromide staining and photographed at 
302nm. 

Predictive Value of Anti-HGV Serology and PGR in the Prospective 
Study: Six of the 1400 donors (0.43%) enrolled into the prospective 
study were found to have antibodies to ClOO in their serum. Of these 
six antibody positive donors only one (donor D6) proved to be 
infectious as judged by the development of PT-NANBH and GlOO 
seroconversion in a recipient (recipient R6) - see Table 6 below. 
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Viral sequences were detected by BCR in the serum of donor D6 but not 
in any of the other five seropositive donor sera. The recipient U 
who developed PT-NANBH had also received blood from seven other donors 
(D7 to D13). Sera from these donors were tested and found to be both 
antibody negative and PGR negative. 

TABLE 6 

nOROR/RECCTTEHT DATA gUMMARY t ER QSPECTXVE STOIg 



PONORS 



RgCIPIEOTS 



Donor 



anti-HGV PGR 



Recipient 



PT-NANBH Anti-HCV 
serocon- 
version 



Dl 

D2 

D3 

D4 

D5 

D6 

D7 

D8 

D9 

DIO 

Dll 

D12 

D13 



+ 

+ 



Rl 
R2 
R3 
R4 
R5 



R6 



No 
No 
No 
No 
No 



Yes* 



No 
No 
No 
No 
No 



Yes+ 



* incubation period 1 month 



+ Seroconversion occurred at 5 months post- transfusion 
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pnamole 14 - 

The lambda gtU libraries prepared in Example 2 were also screened 
with sera from patients with a high risk for PT-KANBH but which dxd 
not react with the viral antigens, DX113. BHCS and BHC-7. the 
reasoning being that they might well contain antibodies whxch 
recognise different antigens. The sera. PJ-S (The Newcastle Royal 
infirmary. Newcastle). Birm-64 (Queen Elizabeth Medical Centre, 
Binninghan.). PG and Le (University College and Middlesex School of 
Medicine. London) met this criterion and were used to screen the 
libraries following the same procedure as described in Examples 3 and 
4 A number of " recombinants were thus Identified, none of whach 
cross-hybridised with probes made from JG2 and JG3 . One of the 
recombinants. BRU. identified by reaction with PJ- 5. was selected for 
further analysis. 

The clone. BRll. contained an insert of approximately 900bp which was 
amplified by PGR using the d75 and d76 primers (SEQ ID NO : 6 and 7) 
as described in Example 7. The amplified sequence was directly cloned 
into the baculovirus vector pAc360 to form pDX128 containing an open 
reading frame in phase with the first 11 amino acids of polyhedrin 
Recombinant baculovirus stocks (designated BHC-9) were produced 
following the procedure described in Example 7. Insect cells were 
infected with purified recombinant virus and a polypeptide of 
approximately 22kl> was obtained in radiolabelled cell extracts. 

The amplified insert of BRll was also cloned into pUCl3 and M13 phage 
vector for sequencing; the DNA and aminoacid sequence data are 
presented in SEQ ID NO : 5. The insert contains 834bp plus the EcoRI 
liTikers added during cloning. 



MJS/AC/12th December 1990 



- 40 - 



PA1121 



jCf^^m plA IS . PBrforma-nee of BHC-9 Po1vp<>pi;lrlft In flU MSA 

An EUSA was established using microtltre wells coated with 
BHC-9-ln£ect cell extract and an antl -human Ig conjugate detection 
system following the procedure as described in Example 10. A panel of 
high-risk sera were assayed in parallel against BHC-7 and BHC-9 and 
were also examined by PGR using the procedure described in Example 13. 
The results are shown in Table 7 in which positive samples are 
londerlined. 



ttumber 


PGR 


1 


+ 


2 




3 


+ 


4 


+ 


5 


+ 


6 




7 




8 


+ 


9 




10 




11 


+ 


12 




13 




14 




15 


+ 


16 




17 




18 




19 




20 




cut-off 







BHC'9 






2.09 




1,89 




1.57 


0.27 




3L02. 


0.91 


2.00 


0.90 


0^ 




1.19 




0.43 






0.37 




0.32 


2.00 


0.23 


0.30 


0.15 




0.16 


p. 76 


0.09 


1.74 


0.27 




0.15 




0.12 




0.08 


6.05 


0.27 


0.29 
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Of these 20 samples. 50% are clearly positive with BHC-7 whereas 85% 
are positive with BHC-9. Two samples (11 & 12) which are borderline 
positive with BHC-7 are clearly positive with BHC-9 and some of the 
samples at or below the cut off with BHC? are positive with BHC-9. 
In addition, two samples (11 6. 15) which were borderline or negative 
with BHC-7 but positive with BHC-9 are PCR-positive . 
Overall there are only two samples (13 & 20) which are negative with 
both polypeptides and PCR. 

Example 16 - 

TsoTattoti nf PT-UAWBtf DWA aeoue n '^ea overlaiTPinE exiStiPS QlV^eg 

The immunological screening of cDNA expression libraries described in 
Examples 3.4 and 14. can only Identify those clones which contain an 
immunoreactive region of the virus. Another approach to the 
production of clones specific for PT-NAHBH is to use PCR to amplify 
cDNA molecules which overlap the existing clones. Sets of primers can 
be prepared where one member of the pair lies within existing cloned 
sequences and the other lies outside; this approach can be extended to 
nested pairs of primers as well. 

cDNA, prepared as described in Example 1. was amplified by PCR, with 
either single or nested pairs of primers, using the reaction 
conditions described in Example 13. The approach is illustrated by 
use of the followiTig pairs of primers; dl64 (SEQ ID NO : 12) and dl37 
(SEQ ID NO : 13); dl36 (SEQ ID NO : 14) and dl55 (SEQ ID NO : 15); 
dl56 (SEQ ID NO : 16) and d92 (SEQ ID NO : 17) . One member of each 
pair is designed to prime within existing cloned sequences (dl37 and 
dl36 prime within the 5' and 3' ends of BRll respeccively . d92 primes 
at the 5' end of JG3). The other primers are based upon sequences 
available for other PT-NANBH agents. Primer dl64 corresponds to bases 
10 to 31 of figure 2 in Okamoto si al. Japan. JL. BSSS^ Mgd^- 1990, £0 
167-177. Primers dl55 and dl56 correspond to positions 462 to 489 and 
3315 to 3337 respectively in figure 47 of European Patent Application 
88310922.5. One or more nucleotide substitutions were made to 
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Introduce an EcoRl recognition site near the 5' end of the primers, 
except for dl64 where a Bgl2 recognition site was introduced; these 
changes facilitate the subsequent cloning of the amplified product. 

The PCR products were digested with the. appropriate restriction 
enzyjae(s) , resolved by agarose gel electrophoresis and bands of the 
ejcpected size were excised and cloned Into both plasmtd and 
bacteriophage vectors as described In Example 5. The sequences of the 
anq)lified DNAs 164/137 (SEQ ID NO : 18), 136/155 (SEQ ID NO : 19) and 
156/92 <SEQ ID NO : 20) are presented in the Sequence Listing. These 
new sequences extend the coverage of the ET-NANBH genome over that 
obtained by imraunoscreening (SEQ ID NO : 3. 4 & 5) . These sequences, 
together with others which lie within the regions already described, 
can be eo»bined into a contiguous sequence at the 5' end (SEQ ID NO ; 
21) and at the 3' -end (SEQ ID NO : 22) of the PT-NANBH genome. 

Example 17 

y„«ton of D j ff^T-^nt: PT-NANBH Antjgens into a ?^7ip1e Rgcon-V^pa'^t 
polypeptide 

•ae data presented in Table 7 indicate that whilst more serum samples 
are detected as antibody-positive using BHC-9 as a target antigen 
(17/20) rather than BHC.7 (10/20) there are some samples (e.g. #4) 
which are positive witii only BHC-7. This picture is borne out by 
wider testing of samples. Accordingly, a fusion construct was derived 
using sequence from BHC-7 and BHC-9. 

Sequences from BHC-7 and BHC-9 may be combined In a variety of ways; 
either sequence may be positioned at the amino terminus of the 
resulting fusion and the nature of the linking sequence may also be 
varied. Figure 2 Illustrates two possible ways in which the sequences 
may be combined. 

Appropriate restriction fragments carrying suitable restriction enzyme 
sites and linker sequences were generated either by PCR using specific 
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primers or by restriction enzyme digestion of existing plaismids. The 
transfer vector DXIM consists of a BaioHl/Pstl fragment from DX122 
(Figure 1; the Pst site is at position 1504 JG2, SEQ ID NO: 3) linked 
to the 5* end of the entire coding region of BRll (SEQ ID NO: 7) which 
has been amplified as a Pstl/BaniHl fragment using primers d24 (SEQ ID 
NO:23) and dl26 (SEQ ID 110:24); the linkage region consists of six 
amino acids derived from the dl26 primer and residual bacteriophage 
laoibda sequences. The transfer vector DK136 differs from DX143 in 
that the BRll fragment was generated using d24 (SEQ ID NO : 23) and 
dl32 (SEQ ID NO : 25) and so the linkage region contains five lysines. 
These transfer vectors were used to co-transfect Sf9 insect cells in 
culture with AcNPV DNA and plaque purified stocks of recombinant 
baculoviruses were produced as described in Example 7. BHC-10 was 
produced as a result of transfectlon with DX143; BHC-11 as a result of 
transfection with DX136. 

The recombinant polypeptides expressed by these two viruses were 
analysed by SDS-PAGE and western blotting. BHC-10 produced a 
polypeptide with an apparent molecular weight of USkDa. BHC-11 
produced a polypeptide with an apparent molecular weight of 96kDa. 
Both polypeptides reacted with sera known to react in ELISA only with 
BHC.7 (e.g. serum A) or only with BHC-9 (serum B64, Example 14). The 
two polypeptides only differ in the linker sequence and this may 
affect either their mobility on SDS-PAGE or how they are processed in 
the infected cells. 

Example 18 - 

Performance of PT-NARBH Fusi on Antigens in an ElflSA 

An ELISA was established using microtitre wells coated with 
BHC-9-infected cell extracts and an anti-human Ig conjugate following 
the procedure described in Example 10. Table 8 presents the data from 
a comparison of the two fusions with the other PT-NANBH recombinant 
antigens BHC-7 and BHC.9 as well as the HCV recombinant protein 
C-100-3 (Ortho Diagnostic Systems, Raritan, New Jersey). The sera are 
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grouped by pattern of reaction with BHC-7. BHC-9 and C-100-3. Group 1 
sera react strongly with all three antigens; Group II react strongly 
with only BHC-7; Group III react strongly with only BHC.9 and Group IV 
react strongly with only two out of the three antigens. 

SERUM BHC-7 BHC-9 C-100-3 BHC-10 BHC-U 



Group I 



AH 


>2.0 


>2.0 


>2.0 


>2.0 


>2.0 


AC 


>2.0 


>2.0 


>2.0 


>2.0 


>2.0 


57 


>2.0 


>2.0 


>2.0 


>2.0 


>2.0 


77 


>2.0 


>2.0 


>2.0 


>2.0 


>2.0 


84 


1.4 


>2.0 


>2.0 


>2.0 


>2.0 



Group II 
805-6 
805-17 
805-149 



>2.0 
>2.0 
>2.0 



0.261 
0.181 
0.651 



0.1 


1.78 


+ 






* 


0.12 


1.37 


+ 






* 


0.084 


1.57 


++ 



Group 111 
JS 


0.32 


>2.0 


0.17 


>2.0 


>2.0 
* 


805-57 


0.069 


1.403 


0.25 


1.9 


+ 

* 


805-82 


0.116 


1-272 


0.4 


1.85 


++ 
* 


805-94 


0.353 


1.675 


0.2 


>2.0 


+ 


PJl 


0.27 


>2.0 


0.2 


>2.0 


1.85 


A 


>2.0 


0.14 


>2.0 


>2.0 


>2.0 


KI 


1.57 


0.27 


>2.0 


>2.0 


>2.0 


Le 


0.152 


>2.0 


>2.0 


>2.0 


>2.0 


PJ5 


0.123 


>2.0 


>2.0 


>2.0 


>2.0 
* 


303-923 


>2.0 


0.9 


0.37 


1.9 


+ 

* 


303-939 


>2.0 


1.55 


0.268 


2.0 


+ 
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* These saniples have only been tested by western blotting on BHC-11. 

These data show that both BHC-10 and BHC-11 have a similar reactivity 
with these sera and, most iniportantly , that the both antigenic 
activities appear to have been retained by the fusions. All the sera 
in Groups II & III, which react with only BHC-7 or BHC-9 respectively, 
give a clear reaction with the fusions. Additionally there is an 
indication that having the two antigens together gives a more 
sensitive assay. For example the sample KT gives ODs of 1,57 and 0,27 
with BHG-7 and BHG-9 respectively vhereas with the fusions the OD is 
>2.0. 
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SEQ ID R0:1 

SEqaENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 21 BASES 

STRANDEDNESS : s ingle 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ORIGIHAL SOURCE ORGANISHrbacteriophage lambda gtll 

IMMEDIATE EXPERIMENTAL SOURCE: Oligonucleotide synthesiser; oligo dl9 

FEATURES : 

from 1 to 21 bases homologous to upstream portion of lacZ gene 
flanking the EcoRl site in bacteriophage lambda gtU 

PROPERTIES: primes DNA synthesis from the phage vector into cDNA 
inserted at the EcoRl site. 

GGTGGGGACG ACTCCTGGAG C 21 
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SEQ ID K0:2 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 21 BASES 



STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ORIGINAL SOURCE ORGANISM: bacteriophage lambda gtll 

IMMEDIATE EXPERIMENTAL SOURCE: 01 igomcleotlde synthesiser; oligo d20 
FEATURES : 

from 1 to 21 bases homologous to downstream portion of lacZ gene 
flanking the EcoRl site in bacteriophage lambda gtll 

PROPERTIES: primes DNA synthesis from the phage vector into cDNA 
inserted at the EcoRl site. 



21 

TTGACACCAG ACCAACTGGT A 
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SEQ ID N0:3 

SEQUENCE TYPE: Nucleotide with corresponding protein 
SEQUENCE LENGTH: 1770 BASE PAIRS 

STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPB:cBNA to genomic RNA 

ORIGINAL SOURCE ORGAJ^ISH:hiiman; serxun 
post-transfuslonal non-A, non-B hepatitis 
IMblEDIATE EXPERIMENTAL SOURCE: clone JG2 from cDNA 
gtll 



infectious for 
library in lambda 



FEATURES: 

from 1 to 1770 bp portion of the PT-NANBH polyprotein 
PROPERTIES: probably encodes viral non- structural proteins 



CAA AAT GAG TTC CCA GAG 6CT GAC CTC ATC GAG GCC AAC CTC CTG TGG 
Gin Asn Asp Phe Pro Asp Ala Asp Leu He Glu Ala Asn Leu Leu Trp 
5 10 ^5 



48 



CGG CAT GAG ATG GGC GGG GAC ATT ACC CGC GTG GAG TCA GAG AAC AAG 
Arg His Glu Met Gly Gly Asp He Thr Arg Val Glu Ser Glu Asn Lys 
20 25 30 



96 



CTA GTA ATC CTG GAC TCT TTC GAC CCG CTC CCA GCG GAG GAG GAT GAG 
Val Val He Leu Asp Ser Phe Asp Pro Leu Arg Ala Glu Glu Asp Glu 
35 40 ^3 



144 



CGG GAA GTG TCC GTC CCG GCG GAG ATC CTG CGG AAA TCC AAG AAA TTC 
Arg Glu Val Ser Val Pro Ala Glu He Leu Arg Lys Ser Lys Lys Phe 
50 55 60 



192 
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CCA CCA GCG ATG CCC GCA TGG GCA CGC CCG GAT TAG AAC OCT CCG CTG 
Pro Pro Ala Het Pro Ala Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu 
65 70 75 80 



240 



CTG GAG TCC TGG AAG GCC CCG GAC TAG GTC CCT pCA GTG GTA CAT GGG 
Leu Glu Ser Trp Lys Ala Pro Asp Tyr Val Pro Pro Val Val His Gly 
85 90 95 



288 



TGG CCA CTG CCA CCT ACT AAG ACC CCT CCT ATA CCA CCT CCA CGG AGA 
Cys Pro Leu Pro Pro Thr Lys Thr Pro Pro He Pro Pro Pro Arg Arg 
100 105 . 110 



336 



AAO AGG ACA GTT GTT CTG ACA GAA TCC ACC GTG TCT TCT GCC CTG GCG 
Lys Arg Thr Val Val Leu Thr Glu Ser Thr Val Ser Ser Ala Leu Ala 
115 120 125 



384 



GAG CTT GCC ACA AAG GOT TTT GGT AGC TCC GGA CCG TCG GCC GTC GAC 
Glu Leu Ala Thr Lys Ala Phe Gly Ser Ser Gly Pro Ser Ala Val Asp 
130 135 1*0 



432 



AGC GGC ACG GCA ACC GCC CCT CCT GAC CAA TCC TCC GAC GAC G6C GGA 
Ser Gly Thr Ala Thr Ala Pro Pro Asp Gin Ser Ser Asp Asp Gly Gly 
145 150 155 160 

GCA GGA TCT GAC GTT GAG TCG TAT TCC TCC ATG CCC CCC CTT GAG GGG 
Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Het Pro Pro Leu Glu Gly 
165 170 175 



480 



528 



GAG CCG GGG GAC CCC GAT CTC AGC GAC GGG TCT TGG TCT ACC GTG ACT 
Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val Ser 
180 185 190 



576 



GAG GAG GCC GGT GAG GAC GTC GTC TGC TGC TCG ATG TCC TAG ACA TGG 
Glu Glu Ala Gly Glu Asp Val Val Cys Cys, Ser Met Ser Tyr Thr Trp 
195 200 205 



624 
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ACA GGC GOT CTG ATC ACG CCA TGC OCT GCG GAG GAA AGC AAG OTG CCC 
Thr Gly Ala Uu lie Thr Pro Cys Ala Ala Glu Glu Ser Lys Leu Pro 
210 215 220 

ATC AAC GCG TTG AGC AAC TCT TTG CTG CGT CAC PAC AAC ATG GTC TAG 
He Asn Ala Uu Ser Asn Ser Leu Leu Arg His His Asn Ket Val Tyr 
225 230 235 240 

GOT ACC ACA TCC CGC AGC GCA AGC CAG CGG CAG AAG AAG GTC AGC TTT 
Ala Thr Thr Ser Arg Ser Ala Ser Gin Arg Gin Lys Lys Val Thr Phe 
245 250 255 

GAG AGA CTG CAA ATC CTG GAG GAT CAC TAG GAG GAC GTG CTC AAG GAG 
Asp Arg Uu Gin He Leu Asp Asp His Tyr Gin Asp Val Uu Lys Glu 
260 263 270 

ATG AAG GCG AAC GCG TCC ACA GTT AAG GCT AAG CTT CTA TCA GTA GAG 
Met Lys Ala Lys Ala Ser Thr Val Lys Ala Lys Leu Uu Ser Val Glu 
275 280 285 

GAA GCC TGC AAG CTG ACG CCC CCA CAT TCG GCC AAA TCT AAA TTT GGC 
Glu Ala Cys Lys Leu Thr Pro Pro Hxa Ser Ala Lys Ser Lys Phe Gly 
290 295 300 

TAT 6GG GCA AAG GAC GTC CGG AAC GTA TCC AGC AAG GCC ATT AAC CAC 
Tyr Gly Ala Lys Asp Val Arg Asn Uu Ser Ser Lys Ala He Asn His 
305 310 . 315 320 

ATC CGC TCC GTG TOG GAG GAC TTG TTG GAA GAC ACT GAA ACA CCA ATT 
He Arg Ser Val Trp Glu Asp Uu Uu Glu Asp Thr Glu Thr Pro He 
325 330 335 



672 



720 



768 



816 



864 



912 



960 



1008 



GAC ACC ACC ATC ATG GCA AAA AAT GAG GTT TTC TGC GTC CAA CCA GAG 1056 
Asp Thr Thr He Het Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu 
340 345 350 



MJS/AC/12th December 1990 



. 51 - 



PA1121 



AGA GGA GGC CGC AAG CCA GCT CGC CTT ATC GTG TTC CCA GAC TTG GGG 
Arg Gly Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro Asp Leu Gly 
355 360 365 



1104 



GTC CGT GTG TGC GAG AAA ATG GCC CTC TAT GAC GTG GTC TCC ACC CTC 
Val Arg Val Cy« Olu Lys Het Ala Leu Tyr Asp Val Val Ser Tbr Leu 
370 375 380 

COT CAG GCT GTG ATG GGC TCC TCG TAG GGA TTC CAG TAT TCT CCT GGA 
Pro Gin Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly 
385 390 395 400 

CAG CGG GTC GAG TTC GTG GTG AAC GCC TCG AAA TCA AAG AAG ACC CCT 
Gin Arg Val Glu Phe Leu Val Asn Ala Trp Lys Ser Lys Lys Thr Pro 
405 410 41-5 



1152 



1200 



1248 



ATG GGC TTT GCA TAT GAC ACC CGC TGT TTT GAC TCA ACA GTC ACT GAG 
Met Gly Phe Ala T^r Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu 
420 425 



1296 



AAT GAC ATC CGT GTA GAG GAG TCA ATT TAT CAA TGT TGT GAC TTG GCC 
Asn Asp He Arg Val Glu Glu Ser He T^r Gin Cys Cys Asp Leu Ala 
435 440 W5 



1344 



CCC GAA GCC AGA CAG GCC ATA AGG TCG CTC ACA GAG CGG CTT TAT ATC 
Pro Glu' Ala Arg Gin Ala He Arg Ser Leu Thr Glu Arg Leu Tyr He 
450 455 *60 



1392 



GGG GGT CCC CTG ACT AAT TCA AAA GGG GAG AAC TGC GGC TAT CGC CGG 
Gly Gly Pro Uu Thr Asn Ser Lys Gly Gin Asn Cys Gly Tyr Arg Arg 
465 470 475 480 



1440 



TGC CGC GGG AGC GGC GTG CTG ACG ACT AGG TGC GGT AAT ACC CTC ACA 
Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr 
485 490 495 



1488 
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TGT TAC TIG AAG GCC TCT GCA CCC TGT CGA GOT GCA AAG CTC GAG GAC 1536 
Cys lyr Leu Lys Ala Ser Ala Ala Cys Arg Ala Ala lys Leu Gin Asp 
500 505 510 

TOC ACG ATG CTC GTG TGC GGA GAC GGC CTT GTC GTT ATC TGT GAG AGC 1584 
Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Cys Glu Ser 
515 520 525 

GCG GGA ACC CAG GAG GAC GCG GCG AGC CTA CGA GTC TTC ACG GAG GGT 1632 
Ala Gly Thr Gin Glu Asp Ala Ala Ser Leu Arg Val Pbe Thr Glu Ala 
530 535 540 

ATG ACT AGG TAC TCT GCC CCC CCC GGG GAC CCG CCC CAA CCA GAA TAC 1680 
Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr 
545 550 555 560 

GAC CTG GAG TTG AIA ACA TCA TGC TCC TCC AAT GTG TGG GTC GCG GAC 1728 
Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala His 
565 570 575 

GAT GCA TCT GGC AAA AGG GTA TAC TAC CTC ACC CGT GAC CCG 1770 
Asp Ala Ser Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro 
580 585 590 
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SEQ ID N0:4 

SEQUENCE TYPE:Nucleotide with corresponding protein 
SEQUENCE LENGTH : 1035 BASE PAIRS 



STBANDEDNESS : single 
TOPOLOCY: linear 

MOLECULE TYPE'.cDNA to genomic RNA 

ORIGINAL SOURCE ORGANISM: human; serum infectious for 
post-transfusional non-A, non«B hepatitis 

IMMEDIATE EXPERIMENTAL SOURCE: clone JG3 from cDNA library in lambda 
gtU 



FEATURES: 

from 1 to 1035 bp portion of the PT-NAHBH polyprotein 
PROPERTIES: probably encodes viral non- structural proteins 



ACA GAA GTG GAT GGG GTG CGG CTG CAC AGG TAG GCT CCG GCG TGC AAA 
Thr Glu Val Asp Gly Val Arg Leu His Arg Tyr Ala Pro Ala Cys Lys 
5 10 15 



48 



OCT CTC CTA CGG GAG GAG GTC ACA TTC CAG GTC GGG CTC AAC CAA TAG 
Pro Leu Leu Arg Glu Glu Val Thr Phe Gin Val Gly Leu Asn Gin Tyr 
20 25 30 



96 



CTG GTT GGG TCG CAG CTC CCA TGC GAG CGC GAA CCG GAT GTA GCA GTG 
Leu Val Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Val Ala Val 
35 40 ^5 



144 



CTC ACT TCC ATG CTC ACC GAC CCG TCC CAC ATC ACA GCA GAG ACG GCT 
Leu Thr Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu Thr Ala 
50 55 60 



192 
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AAG CGC AGG CTC GCC AGG GGG XCT CCC CCC ICC TTG GCC AGC TOT TCA 2A0 
Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser Leu Ala Ser Ser Ser 
65 70 75 80 



GCT AGG CAG TTG TCT GGC OCT TCC TCG AAG GGG ACA lAC ATT AGC CAA 288 
Ala Ser Gin Leu Ser Gly Pro Ser Ser Lys Ala Thr Tyr He Thr Gin 
85 90 95 



AAT GAC TTC CCA GAG GCT 6AC CTC ATC GAG GCC AAC CTC CTG TGG CGG 336 
Asn Asp Fhe Pro Asp Ala Asp Leu lie Glu Ala Asn Leu Leu Trp Arg 
100 105 HO 

CAT GAG ATG GGC GGG GAC ATT ACC CGC GTG GAG TCA GAG AAC AAG GTA 384 
His Glu Het Gly Gly Asp He Thr Arg Val Glu Ser Glu Asn Lys Val 
115 120 125 

GTA ATC CTG GAC TCT TTC GAC CCG CTC CGA GCG GAG GAG GAT GAG CGG 432 
Val He Leu Asp Ser Fhe Asp Pro I^u Arg Ala Glu Glu Asp Glu Arg 
130 135 140 

GAA GTG TCC GTC CCG GCG GAG ATC CTG CGG AAA TCC AAG AAA TTC CGA 480 
Glu Val Ser Val Pro Ala Glu He Leu Arg Lys Ser Lys Lys Fhe Pro 
145 150 ,155 160 

CGA GCG ATG CCC GCA TGG GCA CGC CCG GAT TAG AAC COT CCG CTG CTG 528 
Fro Ala Met Pro Ala Trp Ala Arg Pro Asp Tyr Asn Fro Pro Leu Leu 
165 170 175 



GAG TCC TGG AAG GCC CCG GAC TAG GTC GCT CCA GTG GTA CAT GGG TGC 576 
Glu Ser Trp Lys Ala Pro Asp Tyr Val Pro Pro Val Val His Gly Cys 
180 185 190 

CCA CTG CCA GCT ACT AAG ACC CCT GCT ATA CCA CCT CCA CGG AGA AAG 624 
Pro Leu Pro Pro Thr Lys Thr Pro Pro He Pro Pro Pro Arg Arg Lys 
195 200 205 
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AGG ACA GTT GTT CTG ACA GAA TCC ACG GTG TCT TCT GCC CTG GCG GAG 
Arg Thr Val Val Leu Thr Glu Ser Thr Val Ser Ser Ala Leu Ala Glu 
210 215 220 

CTT GCC ACA AAG OCT TTT GGT AGO TCC GGA CCG TCG GCC CTC GAC AGO 
Leu Ala Thr Lys Ala Phe Gly Ser Ser Gly Pro Ser Ala Val Asp Ser 
225 230 23S 240 

GGC ACG GCA ACC GCC CCT OCT GAC CAA TCC TCC GAC GAC GGC GGA GCA 
Gly Thr Ala Thr Ala Pro Pro Asp Gin Ser Ser Asp Asp Gly Gly Ala 
245 250 255 



672 



720 



768 



GGA TCT GAC GTT GAG TCG TAT TCC TCC ATG CCC GCC CTX GAG GGG GAG 
Gly Ser Asp Val Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu 
260 265 270 



816 



CCG GGG GAC CCC GAT CTC AGC GAC GGG TCT TGG TCT ACC GTG ACT GAG 
Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val Ser Glu 
275 280 285 

GAG GCC GGT GAG GAC GTC GTC TGC TGC TCG ATG TCC TAC ACA TGG ACA 
Glu Ala Gly Glu Asp Val Val Cys Cys Ser Met Ser Tyr Thr Trp Thr 
290 295 300 

GGC OCT. CTG ATC ACG CCA TGC GCT GCG GAG GAA AGC AAG CTG CCC ATC 
Gly Ala Leu lie Thr Pro Cys Ala Ala Glu Glu Ser Lys Leu Pro lie 
305 310 315 320 

AAC GCG TTG AGC AAC TCT TTG CTG CGT CAC CAC AAC ATG GTC TAC GCT 
Asn Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Met Val Tyr Ala 
325 330 335 



864 



912 



960 



1008 



ACC ACA TCC CGC AGC GCA AGC CAG CGG 
Thr Thr Ser Arg Ser Ala Ser Gin Arg 
340 345 



1035 
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SEQ ID NO: 5 

SEQUENCE TYPE: Nucleotide vitb corresponding protein 
SEQUENCE LENGTH: 834 BASE PAIRS 

STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPEicDNA to genomic RNA 

ORIGINAL SOURCE ORGANISH;hiinian; serum infectious for 
post-transfusional non-A, non-B hepatitis 

IMMEDIATE EXPERIMENTAL SOURCE: clone BRll from cDNA library in lambda 
gtll 

FEATORES: 

froia 1 to S34 bp portion of the PT-NANBH polyprotein 
PROPERTIES: probably encodes viral structural proteins 



AGA AAA ACC AAA CGT AAC ACG AAC CTC CGC CCA GAG GAC GTC AGO TTC 
Arg Lys Thr Lys Arg Asn Thr Asn Leu Arg Pro Gin Asp Val Arg Fhe 
5 10 15 



48 



CCG GGC GGT GGT CA6 ATC GTT GGT GGA GTT TAG CTG TTG CCG CGC AGG 
Pro Gly Gly Gly Gin He Val Gly Gly Val Tyr Leu Leu Pro Arg Arg 
20 25 30 



96 



GGC CCC AGG TTG GGT GTG CGC GCG ACT AGG AAG ACT TCC GAG CGG TCG 
Gly Fro Arg Leu Gly Val Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser 
35 40 A5 



144 



CAA OCT CGT GGA AGG CGA CAA CCT ATC CCC AAG GCT CGC GAG CCC GAG 
Gin Bro Arg Gly Arg Arg Gin Pro He Pro Lys Ala Arg Gin Pro Glu 
50 55 60 



192 
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GGC AGG GCC TGG GOT GAG CGC GGG TAC OCT TGG CCC CTC TAT GGC AAC 
Gly Arg Ala Trp Ala Gin Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Astx 
65 70 75 80 

GAG GGC AIG GGG TGG GCA GGA TGG CTC CTG TCA CCC CGT GGC TCC GGG 
Glu Gly Met Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg Gly Ser Arg 
85 90 ^5 

CCT ACT TGG GGG GCC ACT GAC CCC CGG CGT AGG TCG CGT AAT TTG GGT 
Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg Ser Arg Asn Leu Gly 
100 105 110 

AAA GTC ATC GAT ACC CTC ACA TGC GGC TTC GCC GAC TCT CAT GGG GTA 
Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Ser His Gly Val 
115 120 125 



240 



288 



336 



384 



CAT TCC GCT CGT CGG CGC TCC CTT AGG GGC GCT GCC AGG GCC CTG GCG 
His Ser Ala Arg Arg Arg Ser Leu Arg Gly Ala Ala Arg Ala Leu Ala 
130 135 1^0 

CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA ACA GGG AAT 
His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr Gly Asn 
145 150. 155 



432 



160 



430 



TTA CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG CTG TCC TGT 
Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Leu Ser Cys 
165 170 175 



528 



TTG ACC ATT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG TCC GGG ATC 
Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly He 
180 185 190 

TAC CAT GTC ACG AAC GAT TGG TCC AAC TCA AGC ATC GTG TAC GAG ACA 
Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Thr 
195 200 ^05 



576 



624 
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GGG GAG ATG ATC ATG CAC ACC CCC GGG TGT GTG COG TGT GTC CGG GAG 
Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
210 215 220 



672 



GGT AAT TOO TCC CGC TGC TGG GTA GC6 CTC ACT CCC ACG CTC GCG GCC 
Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
225 230 235 240 



720 



AAG GAC GCC AGO ATC CCC ACT GCG ACA ATA CGA CGC CAC GTC GAT TTG 
Lys Asp Ala Ser He Pro Thr Ala Thr lie Arg Arg His Val Asp Leu 
245 250 255 



768 



CTC GTT GGG GCG GCT GCC TTC TCG TCC GOT ATG TAG GTG GGG GAT CTC 
Leu Val Gly Ala Ala Ala Phe Ser Ser Ala Met Tyr Val Gly Asp Leu 
260 265 270 



816 



TGC GGA TCT GTT TTC CCG 
Cys Gly Ser Val Phe Pro 
275 



834 
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SEQ ID N0:6 

SEQUENCE TYPE:Kucleotide 
SEQUENCE LENGTH: 31 BASES 

STRANDEDNESS ; single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ORIGINAL SOURCE ORGAKISM: bacteriophage lambda gtll 

IMMEDIATE EXPERIMENTAL SOURCE: Oligonucleotide synthesiser; oligo d75 
FEATURES: 

from 4 to 9 bases BamHl site 

froia 10 to 31 bases homologous to upstream portion of JLacZ gene 
flanking the EcoRl site in bacteriophage lambda gtll 
from 26 to 31 bases EcoRl site 

PROPERTIES: primes DNA synthesis from the phage vector into cDM 
inserted at the EcoRl site and introduces a BamHl site suitable for 
subsequent cloning into expression vectors. 

TAAGGATCCC CCGTCAGTAT CGGCGGAATT C 31 
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SEQ ID NO: 7 

SEQUENCE TYPE; Nucleotide 
SEQUENCE LENGTH: 30 BASES 

SXRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ORIGINAL SOURCE ORGANISH: bacteriophage lambda gtll 

IMfctEDIATE EXPERIMENTAL SOURCE: Oligonucleotide synthesiser; oligo d76 

FEATURES: 

from 4 to 9 bases BamHl site 

from 10 to 30 bases homologous to downstream portion of lac2 gene 
flanking the EcoRl site in bacteriophage lambda gtll 

PROPERTIES: primes DNA synthesis from the phage vector into cDNA 
inserted at the EcoRl site and introduces a BamHl site suitable for 
subsequent cloning into expression vectors, 

TATGGATCCG TAGCGACCGG CGCTCAGCTG 
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SEQ ID NO: 8 

SEQUENCE TYPE: Nucleotide 
SEQUSICE LENGTH: 19 BASES 

STRANDEDNES S : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DKA 

ORIGINAL SOURCE ORGANISK:hunian; serum infectious for 
post-transfusional non-A, non-B hepatitis 

IMMEDIATE EXPERIMENTAL SOURCE: oligonucleotide synthesiser; oligo d94 
FEATURES : 

from 1 to 19 bases homologous to bases 91A to 932 of the sense strand 
of JG2 (SEQ ID NO : 3) 

PROPERTIES: primes DNA synthesis on the negative strand of PT-NANBH 
genomic RNA/DNA* 

ATGGGGCAAA GGACGTCCG 
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SEQ ID N0:9 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 24 BASES 

STRAl^EDNESS : single 

TOPOUOGY: linear 

KOLECULE TYPE: synthetic DM 

ORIGINAL SOURCE ORGANISM :htiinan; serum infectious for 
post^transfusional non-A, non-B hepatitis 

IMMEDIATE EXPERIMENTAL SOURCE: oligonucleotide synthesiser; oligo d95 
FEATURES : 

from 1 to 24 bases boaologous to bases 1620 to 1643 of the antl- sense 
strand of JG2 (SEQ ID NO : 3) 

PROPERTIES: primes DNA synthesis on the positive strand of PT-NANBH 
genomic RNA/DNA* 



TACCTAGTCA TAGCCTCCGT GAAG 24 
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SEQ ID NO: 10 

SEQUENCE TYPE: Nucleotide 

SEQUENCE LENGTH: 17 BASES 

STRANDEDNESS ! s irvglfi 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ORIGINAL SOURCE ORGANISM: hximan; seruza infectious for 
post*transfusional non-A, non-B hepatitis 

IMMEDIATE EXPERIMENTAL SOURCE ; oligonucleotide synthesiser; oligo Nl 
FEATURES : 

from 1 to 17 bases homologous to bases 1033 to 1049 of the sense 
strand of JG2 (SEQ ID NO ; 3) 

PROPERTIES: prices DNA synthesis on the negative strand of PT-NANBH 
genomic RNA/DKA, 



GAGGTTTTCT GCGTCCA 17 
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SEQ ID NOrll 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 17 BASES 

STRANDEDNESS : single 

TOPOLOGY; linear 

MOLECULE TYPE ;s3mthe tic DNA 

ORIGINAL SOURCE ORGANISM rhtisian; serum infectious for 
post-transfusional non-A, non-B hepatitis 

liWEDIATE EXPERIMENTAL SOURCE: oligonucleotide synthesiser; oligo N2 
FEATURES: 

fro© 1 to 17 bases homologous to bases 1421 to 1437 of the anti- sense 
strand of JG2 (SEQ ID NO : 3) 

PROPERTIES iprlmes DNA synthesis on the positive strand of PT-NANBH 
genomic RNA/DNA* 

GCGATAGCCG CAGTTGT 
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SEQ ID NO: 12 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 22 BASES 

STRANDEDNESS : single 

TOPOLOGY : lineax 

MOLECULE TYPE: synthetic DNA 

ORIGINAL SOURCE ORGANI SK : human ; serum infectious for 
post-transfusional non-A, non^B hepatitis 

IMMEDIATE EXPERIMENTAL SOURCE : oligonucleotide synthesiser; ' oli go dl64 
FEATURES: 

from 1 to 22 bases homologous to bases 10 to 31 of the sequence in Fig 
2 of Okamoto ^ ri. JaT>an, J. Exr^ l^d,, 1990. 60 167^77, base 22 
changed from A to T to introduce Bsl2 recognition site 
from 8 to 13 bases Bgl2 recognition site 

PROPERTIES: primes DNA synthesis on the negative strand of PT-NASIBH 
genomic RNA/DNA and introduces a Bgl2 site. 



CCACCATAGA TCTCTCCCCT GT 
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SEQ ID K0:1S 

SEQUENCE TYPE:Nucleotide 
SEQUENCE LENGTH: 30 BASES 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ORIGINAL SOURCE ORGANISM; human; serum infectious for 
post-transfusional non-A, non-B hepatitis 

IMMEDIATE EXPERIMENTAL SOURCE: oligonucleotide synthesiser; oligo dl37 
FEATURES : 

from 1 to 30 bases homologous to bases 154 to 183 of the negative 
strand of BRll (SEQ ID NO : 5); bases 174. 177 and 178 modified to 
introduce an EcoRl recognition site 
from 5 to 10 bases EcoRl recognition site 

PROPERTIES rprimes DNA synthesis on the positive strand of PT-NASBH 
genomic RNA/DNA and introduces an EcoRl site for cloning 



30 

GCGAGAATTC GGGATAGGTT GTCGCCTTCC 
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SEQ ID NO: 14 

SEQUENCE TYPE: nucleotide 
SECJUENCE LENGTH: 27 BASES 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ORIGINAL SOURCE ORGANISM: human; seriim infectious for 
post-transfusional non-A^ non-B hepatitis 

IMliEDIATE EXPERIMENTAL SOURCE: oligonucleotide synthesiser; oligo dl36 
FEATURES : 

from 1 to 27 bases homologous to bases 672 to 698 of the positive 
strand of BRll (SEQ ID NO : S); base 675 changed to G to introduce an 
EcoRl recognition site 
from 4 to 9 bases EcoRl recognition site 

PROPERTIES: primes DNA synthesis on the negative strand of PT-NANBH 
genomic RNA/DNA and introduces an EcoRl site for cloning 



GGGGAATTCC TCCCGCTGCT GGGTAGC 
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SEQ ID NO: 13 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 28 BASES 

STRANDEDNESS : single 

TOPOLOGY: linear 

HOLECULE TYPE: synthetic DNA 

ORIGINAL SOURCE ORGANISM: chimpanzee; serum infectious for 
post-transfusional non-A, non-B hepatitis 

IMilEDIATE EXPERIMENTAL SOURCE: oligonucleotide synthesiser; oligo dl55 
FEATURES: 

from 1 to 28 bases homologous to bases 462 to 489 of the negative 
strand of figure 47, European Patent Application 88310922.5; bases 483 
and 485 changed to introduce an EcoRl recognition site 
from 5 to 10 bases EcoRl recognition site 

PROPERTIES :primes DNA synthesis on the positive strand of PT-NANBH 
genomic RNA/DNA and introduces an EcoRl site for cloning 



AGGGGAATTC GACCAGGCAC CTCGGTGT 
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SEQ ID WO: 16 

SEQUENCE TYPE: Nucleotide 
SEQUENCE UENGTH:23 BASES 



STRANDEDNESS : single 

TOPOLOGY; linear 

MOLECDLE TYPE: synthetic DBA 

ORIGINAL SOURCE ORGANISM: chimpanzee; serum Infectious for 
post-transfusional non-A, non-B hepatitis 

IMMEDIATE EXPERIMENTAL SOURCE: oligonucleotide synthesiser; oligo dl56 
fIBATURES : 

from 1 to 23 bases homologous to bases 3315 to 3337 of the positive 
strand of figure 47, European Patent Application 88310922.5; base 3323 
changed to C to introduce an EcoRl recognition site 
from 4 to 9 bases EcoRl recognition site 

PROPERTIES .-primes DKA synthesis on the negative strand of PT-NANBH 
genomic BHA/DNA and introduces an EcoRl site for cloning 



23 

CTTGAATTCT GGGAGGGCGT CTT 
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SEQ ID NO: 17 

SEQUENCE TYPE: Nucleotide 

SEQUENCE LENGTH: 29 BASES 

STRANDEDNESS : single 

TOPOLOGY: linear 

HOLECULE TYPE: synthetic DNA 

OEaGINAL SOURCE ORGANISM: human; serum infectious for 
post*transfusional non-A, non-B hepatitis 

IMMEDIATE EXPERIMENTAL SOURCE: oligonucleotide synthesiser; oligo d92 
FEATURES: 

from 1 to 29 bases homologous to bases 36 to 64 of the negative strand 
of JG2 (SEQ ID NO : 3); bases 57 » 58 and 60 changed to introduce an - 
EcoRl recogaition site 

from 5 to 10 bases EcoRl recognition site 

PROPERTIES: primes DNA synthesis on the positive strand of PT-NANBH 
genomic RNA/DNA and introduces an EcoRl site for cloning 

CGCCGAATTC ATGCCGCCAC AGGAGGTTG 
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SEQ ID N0:18 

SEQUENCE TYPE: Nucleotide with corresponding protein 
SEQUENCE LENGTH: 504 BASE PAIRS 

STRANOEDNES S : a ingle 
TOPOLOGY: linear 

MOLECULE TYPE:cDNA to genomic RNA 

ORIGINAL SOURCE ORGANISM: human; serum ii 
post-traijsfusional non-A, non-B hepatitis 
IMMEDIATE EXPERIMENTAL SOURCE: clone 164/137 

FEATURES: 

from 308 to 504 bp start of the PT-NANBH polyprotein 
PROPERTIES rprobably encodes viral structural proteins 



GATCACTCCC CTGTGAGGAA CTACT6TCTT GACGCAGAA& GCGTCTAGCC ATGGCGTTAG 60 

TATGAGTGTC GTGCAGCCTC CAGGACCCCC CCTCCCGGGA 6AGCCATAGT GGTCTGCGGA 120 

AeCGGTGAGT ACACCGGAAT TGCCAGGACG ACCGGGTCCT TTCTTGGATT AACCCGCTCA 180 

ATGCCT6GAG ATTTGGGCGT GCCCCCGCAA OACTGCTAGC CGAGTAGTGT TGGGTCGCGA 240 

AAGGCCTTGT GGTACTGCCT 6ATAGG6TGC TTGCGAGTGC CCCGGGAGGT CTCGTAGACC 300 

GTGCACC ATG AGC ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA CGT AAC 349 
Met Set Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn 
5 10 

ACC AAC CGC CGC CCA CAG GAC GTC AAG TTC CCO GGC GGT GOT CAG ATC 397 
Thr Asn Fro Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He 
15 20 25 30 

err GGT GGA GTT TAG CTG TTG CCG CGC AGG GGC CCC ,AGG TTG GGT GTG 445 
Val Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val 
35 40 ^5 
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CGC GCG ACT AGG AAG ACT TCC GAG CGG TCG CAA COT COT GGA AGG CGA 
Arg Ala Thr Arg Lys Tbr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg 
50 35 60 

CAA OCT ATC CC ■ 
Gin Pro lie Pro 
65 
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SEQ ID NO: 19 

SEQUENCE. TYPE :Nucleotide with corresponding protein 
SEQUENCE LENGTH: 1107 BASE PAIRS 

STRANDEDNES S : single 
XOPOLO0Y: linear 

MOLECULE TYPE:cDHA to genomic RNA 

ORIGINAL SOURCE ORGANISMrhuman; serum infectious 
post-transfusional non-A, non-B hepatitis 
IMMEDIATE EXPERIMENTAL SOURCE:oione 136/155 

FEATURES: 

from 1 to 1107 bp portion of the PT-NANBH polyprotein 
PROPERTIES: probably encodes viral structural proteins 



TCC TCC CGC TGC TGG GIA GCG CTC ACT CCC ACG CTC 6CG GCC AAG GAG 
Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala Lys Asp 
5 10 15 



48 



GCC AGC ATC CCC ACT GCG ACA ATA CGA CGC CAC GTC GAT TTG CTC GTT 
Ala Ser He Pro Thr Ala Thr He Arg Arg His Val Asp Leu Leu Val 
20 25 30 



96 



GGG GCG GOT GCC TTC TGC TCC GOT ATG TAG GTG GGG GAT CTC TGC GGA 
Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu Cys Gly 
35 40 45 



144 



TCT GTT TTC CTC GTC TCT CAG CTG TTC ACC TTC TCG OCT CGC CGA CAT 
Ser Val Phe Leu Val Sex Gin Leu Phe Thr Phe Ser Pro Arg Arg His 
50 55 60 



192 
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GAG ACG GTA GAG GAG TGG AAT TGT TC& ATC TAT CCC GGC CAC GTA TCA 
Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His Val Ser 
65 70 75 80 

GGT CAC CGC ATG OCT TGG GAT ATC ATO AIG AAC TGG TCA CCT ACA GCA 
Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Ala 
85 90 95 

GCC CTA GTG GTA TCG GAG CTA CTC CGG ATC CCA CAA GCT GTC GTG GAC 
Ala Leu Val Val Ser Gin Leu Leu Arg lie Pro Gin Ala Val Val Asp 
100 105 

ATG GTG GOG GGG GCC CAC TGG GGA GTC CTG GCG GGC GTT GGC TAG TAT 
Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr 
115 120 125 

TCC ATG GTG GGG AAC TGG GCT AAG GTC TTG GTT GTG ATG CTA CTC TTT 
Ser Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe 
130 135 1*0 

GCC GGC GTT GAC GGG GAA CCT TAC ACG ACA 66G GGG ACA CAC GGC CGC 
Ala Gly Val Asp Gly Glu Pro Tyr Thr Thr Gly Gly Thr His Gly Arg 
145 150 155 

GCC GCC CAC GGG CTT ACA TCC CTC TTC ACA CCT GGG CCG GCT CAG AAA 
Ala Ala His Gly Leu Thr Ser Leu Phe Thr Pro Gly Pro Ala Gin Lys 
165 170 

ATC CAG CTT GTA AAC ACC AAC GGC AGC TGG CAC ATC AAC AGA ACT GCC 
He Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala 
180 185 

TTG AAC TGG AAT GAC TCC CTC CAA ACT GGG TTC CTT GCC GCG CTG TTC 
Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Leu Ala Ala Leu Phe 
195 200 205 



240 



286 



336 



384 



432 



480 



528 



576 



624 
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TAG ACG CkC AGG TTC AAT GCG TCC GGA T6C TCA GAG CGC ATG GCG AGO 
Tyr Thr His Arg Phe Asn Ala Ser Gly Cys Ser Glu Arg Het Ala Ser 
210 215 220 



672 



TGC CGC GCG ATT GAG GAG TTC GAT CAG GGG TGG GOT CGC ATG ACT TAT 
Cys Arg Pro lie Asp Gin Phe Asp Gin Gly Trp Gly Pro He Thr Tyr 
225 230 235 240 

AAT GAG TCC CAC GGC TTG GAG CAG AGG CGC TAT TGC TGG CAC TAG GCA 
Asn Glu Ser His Gly Leu Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala 
2A5 250 255 



720 



768 



GOT CAA GCG TOT GGT ATG GTG GCG GCG TTG CAG GTG TGT GGC CCA GTG 
Pro Gin Pro Cys Gly He Val Pro Ala Leu Gin Val Cys Gly Pro Val 
260 265 270 

TAG TGT TTC ACT CCA AGC CCT GTT GTG GTG GGG ACG ACG GAT CGT TTC 
Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe 
275 280 285 

GGC GCC CCT ACG TAG A6A TGG GGT GAG AAT GAG ACG GAC GTG CTG CTT 
Gly Ala Pro Thr Tyr Arg Trp Gly Glu Asn Glu Thr Asp Val Leu Leu 
290 295 300 

CTC AAC AAC ACG GGG CCG CCA GGG GGC AAC TGG TTC GGC TGT ACA TGG 
Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp 
305 310 315 320 

ATG AAT AGC ACC GGG TTC ACC AAG AGG TGT GGG CGC CCG CCG TGC AAC 
Met Asn Ser Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn 
325 330 335 



816 



864 



912 



960 



1008 



ATC GGG GGG GTG GGC AAC AAC ACT TTG ATG TGC CGC ACG GAC TGC TTG 
He Gly Gly Val Gly Asn Asn Thr Leu He Cys Pro Thr Asp Cys Phe 
340 345 350 



1056 
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CGG AAG CAT CCC GAG GCC ACT TAG AGO AAA TGC GGT TCG GGG COT TGG 1104 
Arg Lys His Pto Glu Ala Thr Tyr Thr Lys Cys Gly Ser Gly Pro Trp 
355 560 365 



TTG 

1107 

Leu 
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SEQ ID NO: 20 

SEQUENCE TYPE '.Nucleotide with corresponding protein 
SEQUENCE LENGTH; 2043 BASE PAIRS 

STRANDEDNESS : s ingle 
TOPOLOGY: linear 

HOLECOLE TYPErcDNA. to genomic RNA 

ORIGINAL SOURCE ORGANISM: human; serum infect 
post-transfuslonal non-A, non-B hepatitis 
IMMEDIATE EXPERIMENTAL SOURCE : clone 156/92 

FEATURES: 

from 1 to 2043 bp portion of the PT-NANBH polyprotein 
PROPERTIES: probably encodes viral non- structural proteins 



TOG GAG GGC GTC TTC ACA GGC CTC AGC CAC GTG GAT GCC CAC TTC CTG 
Trp Glu.Gly Val Phe Thr Gly Leu Thr His Val Asp Ala His Phe Leu 
5 10 15 



48 



TCC CAA ACA AAG CAG GCA GGA GAC AAC TTC CCC TAC CTG GTG GCG TAG 
Ser Gin Thr Lys Gin Ala Gly Asp Asn Phe Pro Tyr Leu Val Ala Tyr 
20 25 30 



96 



CAG GCT ACT GTG TGC GOT AGG GCC CAG GCC CCA CCT CCA TCA TGG GAT 
Gin Ala Thr Val Gys Ala Arg Ala Gin Ala Pro Pro Pro Ser Trp Asp 
33 40 *5 



144 



CAA ATG TGG AAG TGT CTC ATA CGG CTA AAG CCT ACT CTG GGC GGG CCA 
Gin Met Trp Lys Cys Leu He Arg Leu Lys Pro Thr Leu Arg Gly Pro 
50 55 ^° 



192 
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ACA CCC TIG CTG TAT AGG CTG GGA GCC GTC CAA AAC GAG GTC ACC CTC 240 
Thr Pro Leu Leu Tyr Arg Leu Gly Ala Val Gin Asu Glu Val Thr Leu 
65 70 75 80 



ACA CAC CCC ATA ACC AAA TTC ATC ATG GCA TGC ATG TCA GCC GAC CTG 
Thr His Pro He Thr Lys Phe He Met Ala Cys Met Ser Ala Asp Leu 
85 90 95 



288 



GAG GTC GTC ACG AGO ACC TGG GTG CTG GTG GGC GGG GTC CTT GCA GCT 
Glu Val Val Thr Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala Ala 
100 105 110 



336 



CTG GCT GCG TAT TGC TTG ACA ACA GGC AGC GTG GTC ATT GTG GGT AGG 
Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val Val He Val Gly Arg 
115 120 125 



384 



ATC ATC TTG TGC GGG GGG CCG GCT ATT GTT CCC GAG AGG GAA GTC CTC 
He He Leu Ser Gly Arg Pro Ala He Val Pro Asp Arg Glu Val Leu 
130 135 140 



432 



TAG CAG GAG TTC GAT GAG ATG GAA GAG TGC GCG TGG CAC CTC OCT TAG 
Tyr Gin Glu Phe Asp Glu Met Glu Glu Cys Ala Ser His Leu Pro Tyr 
145 150 155 160 



480 



ATC GAG CAG GGA ATG CAG CTC GCC GAG CAG TTC AAG CAA AAA GCG CTC 
He Glu Gin Gly Met Gin Leu Ala Glu Gin Phe Lys Gin Lys Ala Leu 
165 170 175 



528 



GGG TTG CTG CAG ACA GCC ACC AAG CAA GCG GAG GCC GCT GCT CCC GTG 
Gly Leu Leu Gin Thr Ala Thr Lys Gin Ala Glu Ala Ala Ala Pro Val 
180 185 190 



576 



GTG GAG TGC AAG TGG CGA GCC CTT GAG ACC TTC TGG GCG AAA CAC ATG 
Val Glu Ser Lys Trp Arg Ala Leu Glu Thr Phe Trp Ala Lys His Met 
195 200 205 



624 
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TGG AAC TTC ATC AGO GGG ATA CAG TAG TTA GCA GGC TTG TCC ACT CTG 
Trp Asn Phe lie Ser Gly lie Gin Tyr Lau Ala Gly Leu Ser Thr Leu 
210 215 220 



672 



OCT GGG AAT CCG GGG ATT GCA TCA CTG ATG GCG TTC ACA GCC TOT GTC 
Pro Gly Asn Pro Ala He Ala Ser Leu Met Ala Phe Thr Ala Ser Val 
225 230 235 240 

ACT AGC CCG CTC ACC ACC CAA TCT ACC CTC CTG CTT AAC ATC CTG GGG 
Thr Ser Pro Leu Thr Thr Gin Ser Thr Leu Leu Leu Asn lie Leu Gly 
245 250 255 



720 



768 



GGA TGG GTA GCC GCC CAA CTC GCT CCC CCG ACT GCT GCT TCA GOT TTC 
Gly Trp Val Ala Ala Gin Leu Ala Pro Pro Ser Ala Ala Ser Ala Phe 
260 265 270 



816 



GTA GGC GCC GGC ATT GCT GGT GCG GCT GTT GGC AGC ATA GGC CTT GGG 
Val Gly Ala Gly He Ala Gly Ala Ala Val Gly Ser lie Gly Leu Gly 
275 280 285 



864 



AAG GTG CTT GTG GAC ATC TTG GCG GGC TAT GGA GCA GGA GTG GCA GGC 
Lys VaL Leu Val Aap lie Uu Ala Gly Tyr Gly AU Gly Val Ala Gly 
290 295 300 

GCG CTC GTG GCC TTT AAG GTC ATG AGC GGC GAA ATG CCC TCC ACC GAG 
Ala Leu Val Ala Phe Lys Val Met Ser Gly Glu Met Pro Ser Thr Glu 
305 310 315 320 

GAC CTG GTT AAC TTA CTC CCT GCC ATC CTC TCT CCT GGT GCC CTG GTC 
Asp Leu Val Asn Leu Leu Pro Ala He Leu Ser Pro Gly Ala Leu Val 
325 330 335 



912 



960 



1008 



GTC GGG GTC GTG TGC GCA GCG ATA CTG CGT CGG CAG GTG GGT CCA GGG 
Val Gly Val Val Cys Ala Ala He Leu Arg Arg His Val Gly Pro Gly 
340 345 350 



1056 
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GAG GGG GCT GTG CAG TGG ATG AAC CGG CTG ATA GCG TTC GCC TOG CGG 1104 
Glu Gly Ala Val Gin Trp Met Asn Arg Leu lie Ala Phe Ala Ser Arg 
355 360 365 

GGT AAC CAT GTT TCC CCC ACG CAC TAT GTG CCA GAG AGO GAC GCC GCA 1152 
Gly Asn His Val Ser Pro Thr His Tyr Val Pro Glu Ser Asp Ala Ala 
370 375 380 

GCA GGT GTC ACT CAG ATC CTC TCC GAC CTT ACT ATC ACC CAA CTG TTG 1200 
Ala Arg Val Thr Gin lie Leu Ser Asp Leu Thr He Thr Gin Leu Leu 
385 390 395 400 

AAG AGG CTC CAC CAG TGG ATT AAC GAG GAC TGC TCC ACG CCC TGC TCC 1248 
Lys Arg Leu His Gin Trp lie Asn Glu Asp Cys Ser Thr Pro Cys Ser 
405 410 415 

GGC TCG TGG OTA AGG GAT GTT TGG GAC TGG ATA TGC ACA GTT TTG GCT 1296 
Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys Thr Val Leu Ala 
420 425 430 

GAC TTC AAG ACC TGG CTC CAG TCC AAG CTC CTG CCG CGA TTA CCG GGA 1344 
Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro Gly 
435 440 445 

GTC CCC TTT TTC TCA TGC CAA CGT GGG TAG AAG GGG GTC TGG CGG GGA 1392 
Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg Gly 
450 455 460 

GAC GGC ATC ATG CAG ACC ACC TGC TCA TGT GGA GCA CAG ATC ACC GGA 1440 
Asp Gly He Met Gin Thr Thr Cys Ser Cys Gly Ala Gin He Thr Gly 
465 470 475 ^80 

CAT GTC AAA AAC GGT TCC ATG AGG ATC GTT GGG CCT AAG ACC TGT AGT 1488 
His Val Lys Asn Gly Ser Met Arg He Val Gly Pro Lys Thr Cys Ser 
485 490 495 
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AAC ATG TGG CAT GGA ACA TTC CCC ATC AAC GCA TAG AGO ACG GGC CCG 
Ash Met Trp His Gly Thr Phe Pro lie Asti Ala Tyr Thr Thr Gly Pro 
500 505 510 



153€ 



TGC ACG CCC TCC CCA GCG CCA AAC TAT TCC AGG GCG CTG TGG CGG GTG 
Cys Thr Pro Ser Pro Ala Pro Asti Tyr Ser Arg Ala Leu Trp Arg Val 
515 520 525 



1S8A 



GCT GCT GAG GAG TAG GTG GAG GTT AGG CGG GTG GGG GAT TTC CAC TAG 
Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His Tyr 
530 535 540 



1632 



GTG ACG AGC ATG ACC ACT GAC AAC GTA AAA TGC CCG TGC GAG GTT CCA 
Val Thr Ser Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val Pro 
545 550 555 560 

GCG CCC GAA TTC TTC ACA GAA GTG GAT GGG GTG CGG CTG CAC AGG TAG 
Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg Tyr 
565 570 575 



1680 



1728 



GCT CCG GCG TGC AAA CCT CTC CTA CGG GAG GAG GTC ACA TTC CAG GTC 
Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Gin Val 
580 585 590 



1776 



GGG CTC AAC CAA TAC CTG GTT GGG TCG CAG CTC CCA TGC GAG CCC GAA 1824 
Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro Cys Glu Pro Glu 
595 600 605 



CCG GAT GTA GCA GTG CTC ACT TCC ATG CTC ACC GAC CCC TCC CAC ATC 
Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His He 
610 615 620 



1872 



ACA GCA GAG ACG GCT AAG CGC AGG CTG GCC AGG GGG TCT CCC CCC TCC 1920 
Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser 
625 630 635 640 
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TTG GCC AGC TCT TCA GCT AGO CAG TIG TCT GCG CCT tCC TCG AAG GCC 1968 
Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Ser Lys Ala 
645 650 655 

ACA TAG ATT ACC CAA AAT GAC TTC CGA GAG GCT PAC CTC ATC GAG GCC 2016 
Thr Tyr He Thr Gin Asn Asp Phe Pro Asp Ala Asp Leu He Glu Ala 
660 665 670 



AAC CTC CTG TGG CGG CAT GAG ATG GGC 
Asn Leu Leu Trp Arg His Glu Ket Gly 
675 680 



2043 
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SEQ ID NO: 21 

SEQUENCE TYPE:NviclBotide with corresponding protein 
SEQTJENCE LENGTH: 2116 BASE PAIRS 

STRANDEDNESS : single 
TOPOLOGY: linear 

KOLECULE TYPE:cDHA to genomic RNA 

ORIGINAL SOURCE ORGAI?ISH:huinan; serum infectious for 
post-transfusionfll non-A, non-B hepatitis 

IMMEDIATE EXPERIMENTAL SOURCE rcontig formed by cDNA clones from 5' end 
of the genome 

FEATURES: 

from 308 to 2116 bp start of the PT-RANBH polyprotein 
PROPERTIES: viral structural and non- structural proteins 



GATCACTCCC CTGTGAGGAA CTAGIGTCTT CACGCAGAAA GCGTCTAGCC ATGGCGTTAG 60 

TATGAGTGTC GTGCAGCCTC CAGGACCCCC CCTCCCGGGA GAGGCATAGT GGTCTGCGGA 120 

ACCGGTGAGT ACACCGGAAT TGCCAGGACG ACCGGGTCCT TrCTTGGATT AACCCGGTCA 180 

ATGCCTGGAG ATTTGGGCGT GCCCCCGCAA GACTGCTAGC CGAGTAGTGT TGGGTCGCGA 240 

AAGGCCTTGT GGTACTGCCT GATAGGGTGC TTGCGAGTGC CCCGGGAGGT CTCGTAGACC 300 

GTGCACC ATG AGO ACG AAT OCT AAA COT CAA AGA AAA AGO AAA CGT AAC 349 
Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn 
5 10 



AGO AAG CGC CGC CCA GAG GAC GTC AAG TTC CCG GGC GGT GGT CAG ATC 
Thr Asn Pro Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie 
15 20 



25 30 



GTT GGT GGA GTT TAG CTG TIG CCG CGC AGG GGC CCG AGG TTG GGT GTG 
Val Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val 
35 40 ^5 



397 



445 
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CGC GCG ACT AGG AAG ACT TCC GAG CGG ICG CAA OCT CGT GGA AGG CGA 
Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg 
50 55 60 



493 



CAA CCT ATC CCC AAG GCT CGC CAO CCC GAG GGC AGG GCC TGG GCT CAG 
Gin Pro lie Pro Lys Ala Arg Gin Pro Glu Gly Arg Ala Trp Ala Gin 
65 70 75 



541 



CCC GGG TAG CCT TGG CCC CTC TAT GGC AAC GAG GGC ATG GGG TGG GCA 
Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Met Gly Trp Ala 
80 85 90 



589 



GGA TGG CTC CTG TCA CCC CGT GGC TCC CGG CCT AGT TGG GGC CCC ACT 
Gly Trp Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr 
100 105 110 115 



637 



GAC CCC CGG CGT AGG TCG CGT AAT TTG GGT AAA GTC ATC GAT ACC CTC 
Asp Pro Arg Arg Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu 
120 125 130 



685 



ACA TGC GGC TTC GCC GAC CTC ATG GGG TAC ATT CCG CTC GTC GGC GCT 
Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala 
135 140 145 



733 



CCC TTA GGG GGC GCT GCC AGG GCC CTG GCG CAT GGC GTC CGG GTT CTG 
Pro Leu Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu 
150 155 160 



781 



GAG GAC GGC GTG AAC lAT GCA ACA GGG AAT TTA CCC GGT TGC TCT TTC 
Glu Asp Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys- Ser Phe 
165 170 175 



829 



TCT ATC TTC CTC TTG GCT TTG CTG TCC TGT TTG ACC ATT CCA GCT TCC 
Ser Ho Phe Leu Leu Ala Leu Uu Ser Cys Leu Thr He Pro Ala Ser 
180 185 190 195 



877 
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GOT TAT GAA GTG CGC MC GTG TCC GGG ATC TAG CAT GTC ACG AAC GAT 925 
Ala Tyr Glu Val Arg Asti Val Ser Gly lie Tyr His Val Thr Asn Asp 
200 205 210 



TGC TCC AAC TCA AGC ATC GTG TAG GAG ACA GCG GAG ATG ATC ATG CAC 
Cys Ser Asn Ser Ser lie Val Tyr' Glu Thr Ala Asp Met He Met His 
215 220 225 

ACC CCC GGG TGT GTG CCC TGT GTC CGG GAG GGT AAT TCC TCC CGC TGC 
Thr Pro Gly Cya Val Pro Cys Val Arg Glu Gly Asn Ser Ser Arg Cys 
230 235 240 

TGG GTA GCG GTC ACT CCC ACG CTC GCG GCC AAG GAG GCC AGC ATC CCC 
Trp Val Ala Leu Thr Pro Thr Leu Ala Ala Lys Asp Ala Ser lie Pre 
245 250 255 

ACT GCG ACA ATA CGA CGC CAC GTC GAT TTG CTC GTT GGG GCG GCT GCC 
Thr Ala Thr lie Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala 
260 265 270 275 

TTC TGC TCC GCT ATG TAG GTG GGG GAT CTC TGC GGA TCT GTT TTC CTC 
Phe Cys Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu 
280 285 290 

GTC TCT CAG CTG TTC ACC TTC TCG GCT CGC CGA CAT CAG ACG GTA CAG 
Val Ser Gin Leu Phe Thr Phe Ser Pro Arg Arg His Gin Thr Val Gin 
295 300 

GAC TGC AAT TGT TCA ATC TAT CCC GGC CAC GTA TCA GGT CAC CGC ATG 
Asp Cys Asn Cys Ser He Tyr Pro Gly His Val Ser Gly His Arg Met 
310 315 320 



973 



1021 



1069 



1117 



1165 



1213 



1261 



GCT TGG GAT ATG ATG ATG AAC TGG TCA CCT ACA CCA GCC CTA GTG GTA 1309 
Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Ala Ala Leu Val Val 
325 330 335 



liJS/AC/12th December 1990 



- 86 - 



PA1121 



TC6 CA6 CTA CTC CGG ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG 1357 
Ser Gin Leu Leu Arg lie Pro Gin Ala Val Val Asp Met Val Ala Gly 
340 345 350 355 

GCC CAC TGG GGA GTC CIG GCG GGC CTT GCC TAG TAT TCC ATG GTG GGG 1405 
Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly 
360 365 370 

AAC TGG GCT AAG GTC TTG GTT GTG ATG CTA CTC TXT GCC GGC GTT GAC 1453 
Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp 
375 380 385 

GGG GAA CCT TAC ACG ACA GGG GGG AGA CAC GGC CGC GCC GCC CAC GGG 1501 
Gly Glu Pro Tyr Thr Thr Gly Gly Thr His Gly Arg Ala Ala His Gly 
390 395 400 

CTT ACA TCC GTC TTC ACA CCT GGG CCG GCT CAO AAA ATC CAG CTT GTA 15A9 
Leu Thr Ser Leu Phe Thr Pro Gly Pro Ala Gin Lys lie Gin Leu VaL 
405 • 410 415 

AAC ACC AAC GGC AGC TGG CAC ATC AAC AGA ACT GCC TTG AAC TGC AAT 1597 
Asn Thr Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu Asn Cys Asn 
420 425 430 435 

GAC TCC CTC CAA ACT GGG TTC CTT GCC GCG CTG TTC TAC ACG CAC AGG 1645 
Asp Ser Leu Gin Thr Gly Phe Leu Ala Ala Leu Phe Tyr Thr His Arg 
440 445 450 

TTC AAT GCG TCC GGA TGC TCA GAG CGC ATG GCC AGC TGC CGC CCC ATT 1693 
Phe Asn Ala Ser Gly Cys Ser Glu Arg Met Ala Ser Cys Arg Pro lie 
455 460 465 

GAC CAG TTC GAT CAG GGG TGG GGT CCC ATC ACT TAT AAT GAG TCC CAC 1741. 
Asp Gin Phe Asp Gin Gly Trp Gly Pro He Thr Tyr Asn Glu Ser His 
470 475 480 
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GGC TTG GAC CAG AGG CCC TAT TGC TGG CAC TAG GCA CCT CAA COG TGT 
Gly Leu Asp Gin Arg Pro Tyr Cys Trp His Tyt Ala Pro Gin Pro Cys 
485 490 495 

GGT ATC GTG CCC GCG TTG CAG GTG TGT GGC CCA GTG TAG TGT TTC ACT 
Gly He Val Pro Ala Leu Gin Val Cys Gly Pro Val Tyr Cys Phe Thr 
500 505 510 515 

CCA AGC CCT GTT GTG GTG GGG ACG ACC GAT C6T TTC GGC GCC CCT ACG 
Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Ala Pro Thr 
520 525 530 

TAG AGA TGG GGT GAG AAT GAG ACG GAC GTG CTG CTT CTC AAC AAC ACG 
Tyr Arg Trp Gly Glu Asn Glu Thr Asp Val Leu Leu Leu Asn Asn Thr 
535 540 545 

CGG CCG CCA CGG GGC AAC TGG TTG GGC TGT ACA TGG ATG AAT AGC ACC 
Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr 
550 ■ 555 560 

GGG TTC ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC GGG GGG GTG 
Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Val 
565 570 575 

GGC AAC AAC ACT TTG ATC TGC CCC ACG GAC TGC TTC CGG AAG CAT CCC 
Gly Asn Asn Thr Leu He Cys Pro Thr Asp Cys Phe Arg Lys His Pro 
580 585 590 595 

GAG GCC ACT TAG ACC AAA TGC GGT TCG GGG CCT TGG TTG 
Glu Ala Thr Tyr Thr Lys Cys Gly Ser Gly Pro Trp Leu 
600 605 



1789 



1837 



1885 



1933 



1981 



2029 



2077 



2116 
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SEQ ID HO: 22 

SEQUENCE TYPE: Nucleotide with corresponding protein 
SEQUENCE LENGTH; 3750 BASE PAIRS 

smilDEDNESS : single 
TOPOLOGY: linear 

IK>LECUI£ TYPE:cDNA to genomic RNA 

ORIGINAL SOURCE ORGflHISM:htxman; serum infectious 
post-transfustonal non-A, non-B hepatitis 

IMMEDIATE EXPERIMENTAL SOIJRCE:contig formed by cDNA clones from 3' 
of tbe genome 

FEATURES: 

from 1 to 3750 bp portion of the PT-NANBH polyprotein 
PROPERTIES: viral non-struotxiral proteins 



TGG GAG GGC GTC TTC ACA GGC CTC AGO CAC GTG GAT GCC CAC TTC CTG 
Trp Glu Gly Val Phe Thr Gly Leu Thr His Val Asp Ala His Phe Leu 
5 10 15 



A3 



TCC CAA ACA AAG CAG GCA GGA GAG AAC TTC CCC TAG CTG GTG GCG TAG 
Ser Oln Thr Lys Gin Ala Gly Asp Asn Phe Pro Tyr Leu Val Ala Tyr 
20 25 30 



96 



CAG GOT ACT GTG TGC GCT AGG GCC CAG GCC CCA CCT CCA TCA TGG GAT 
Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro Pro Pro Ser Trp Asp 
35 40 45 



144 



CAA ATG TGG AAG TGT CTC ATA CGG CTA AAG CCT ACT GTG CGC GGG CCA 
Gin Met Trp Lys Cys Leu He Arg Leu Lys Pro Thr Leu Arg Gly Pro 
50 55 60 



192 
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ACA CCC TTG CTG TAT AGG CTO G6A GGC GTC CAA AAC GAG GTC AGO CTC 
Thr Pro Leu Leu Tyr Arg Leu Gly Ala Val Gin Asn Glu Val Thr Leu 
65 70 75 80 



240 



ACA CAC CCC ATA ACC AAA TTC ATC ATG GCA TGC ATG TCA GCC GAC CTG 
Thr His Pro lie Thr Lys Phe lie Met Ala Cys Met Ser Ala Aap Leu 
8S 90 95 



288 



GAG GTC GTC ACG AGC ACC TGG GTG CTC GTG GGC GGG GTC CTT GCA GCT 
Glu Val Val Thr Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala Ala 
100 105 110 

CTG GCT 6CG TAT TGC TTG ACA ACA GGC AGC GTG GTC ATT GTG GGT AGG 
Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val Val lie Val Gly Arg 
115 120 125 

ATC ATC TTG TCC GGG CGG CCG GCT ATT GTT CCC GAC AGG GAA GTC CTC 
He He Leu Ser Gly Arg Pro Ala He Val Pro Asp Arg Glu Val Leu 
130 ■ 135 1*0 



336 



384 



432 



TAG GAG GAG TTC GAT GAG ATG GAA GAG TGC GCG TCG CAC CTC CCT TAG 
Tyr Gin Glu Phe Asp Glu Met Glu Glu Cys Ala Ser His Leu Pro Tyr 
145 150 155 160 

ATC GAG CAO GGA ATG CAG CTC GCC GAG CAG TTC AAG CAA AAA GCG CTC 
He Glu Gin Gly Met Gin Leu Ala Glu Gin Phe Lys Gin Lys Ala Leu 
165 170 175 



480 



528 



GGG TTG CTG CAG ACA GCC ACC AAG CAA GCG GAG GCC GCT GCT CCC GTG 
Gly Leu Leu Gin Thr Ala Thr Lys Gin Ala Glu Ala Ala Ala Pro Val 
180 185 190 



576 



GTG GAG TCC AAG TGG GGA GCC CTT GAG ACC TTC TGG GCG AAA CAC ATG 
Val Glu Ser Lys Trp Arg Ala Leu Glu Thr Phe Trp Ala Lys His Met 
195 200 205 



624 
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TGG AAC TTC ATC AGC GGG A,TA GAG TAG TTA GCA GGC TTG TCC ACT CIG 
Trp Asn Phe lie Ser Gly He GZsi Tyr Leu Ala Gly Leu Ser Thr Leu 
210 215 220 

COT GGG AAT CCC GC6 ATT GCA TCA CTG ATG GCG TTC ACA GCC TCT GTC 
Pro Gly Asn Pro Ala lie Ala Ser Leu Het Ala Phe Tbr Ala Ser Val 
225 230 235 240 

ACT AGC GCG CTC ACC ACC CAA TCT ACC CTC CTG CTT AAC ATC CTG GGG 
Thr Ser Pro Leu Thr Thr Gin Ser Thr Leu Leu Leu Asn He Leu Gly 
245 250 255 



672 



720 



768 



GGA TGG GTA GCC GCC CAA CTG GCT CCC CCC ACT GCT GOT TCA GCT TTC 
Gly Trp Val Ala Ala Gin Leu Ala Pro Pro Ser Ala Ala Ser Ala Phe 
260 265 270 

GTA GGG GCC GGC ATT GCT GGT GCG GCT GTT GGC AGC ATA GGC CTT GGG 
Val Gly Ala Gly He Ala Gly Ala Ala Val Gly Ser He Gly Leu Gly 

275 280 285 

AAG GTG CTT GTG GAC ATC TTG GCG GGC TAT GGA GCA GGA GTG GCA GGC 
Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala Gly Val Ala Gly 
290 295 300 

GCG CTC GTG GCC TTT AAG GTC ATG AGC GGC GAA ATG CCC TCC ACC GAG 
Ala Leu Val Ala Phe Lys Val Met Ser Gly Glu Met Pro Ser Thr Glu 
305 310 315 320 

GAC CTG GTT AAC TTA CTC GCT GCC ATC CTC TCT CCT GGT GCC CTG GTC 
Asp Leu Val Asn Leu Leu Pro Ala He Leu Ser Pro Gly Ala Leu Val 
325 330 335 



816 



864 



912 



960 



1008 



GTC GGG GTC GTG TGG OCA GCG ATA CTG CGT CGG CAC GTG GGT CCA GGG 1036 
Val Gly Val Val Cys Ala Ala He Leu Arg Arg His Val Gly Pro Gly 
340 345 350 



llJS/AC/12th December 1990 



91 - PA.1121 



GAG GGO GCT GTG CAG TGG ATG AAC CGG CTG ATA GCC TTC CCC TCG CGG 1104 
Clu Gly Ala Val Gin Ttp Met Asn Arg Leu He Ala Phe Ala Ser Arg 
355 360 365 

GGT AAC CAI GTT TCG CCC ACG CAC TAT GTG CCA GAG AGO GAC GCC GCA U52 
Gly Asn His Val Ser Pro Thr His Tyr Val Pro Clu Ser Asp Ala Ala 
370 375 380 

GCA CGT GTC ACT CAG ATC CTC TCC GAC CTT ACT ATC ACC CAA CTG TTG 1200 
Ala Arg Val Thr Glt> He Leu Ser Asp Leu Thr lie Thr Gin Leu Leu 
385 390 395 ^00 

AAG AGG CTC GAC CAG TGG ATT AAC GAG GAC TGG TCC ACG CCC TGC TCC 12A8 
Lys Arg Leu His Glt> Trp He Asn Glu Asp Cys Ser Thr Pro Cya Ser 
405 



415 



GGC TCG TGG CTA AGG GAT GTT TGG GAC TGG ATA TGC ACA GTT TTG GCT 
Gly ser Trp Leu Arg Asp Val Trp Asp Trp He Cys Thr Val Leu Ala 
420 - «5 A30 

GAC TTC AAG ACC TGG CTC CAG TCC AAG CTC CTG CGG CGA TTA CCG GGA 
Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Fro Arg l^u Pro Gly 
435 440 445 

CTC CCC.TTT TTC TCA TGC CAA CGT CGG TAG AAG CGG GTC TGG CGG GGA 
Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg Gly 
450 455 

GAC GGC ATC ATG CAG ACC ACC TGC TCA TGT GCA GCA CAG ATC ACC GGA 
Asp Gly He Met Gin Thr Thr Cys Ser Cys Gly Ala Gin lie Thr Gly 
465 470 ^75 

CAT GTC AAA AAC GGT TCC ATG AGG ATC GTT GGG OCT AAG ACC TGT ACT 
His Val Lys Asn Gly Ser Met Arg lie Val Gly Pro Lys Thr Cys Ser 
485 490 ^55 



1296 



1344 



1392 



1440 



1488 
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AAC AIG TGG CAT GGA ACA TTC CGC ATG AAC GCA TAG ACC ACG GGC COG 1536 
Asn Met Trp His Gly Thr Phe Pro He Asn Ala Tyr Thr Thr Cly Pro 
500 505 510 

TGG ACG CCC TCC CCA GGG CCA AAC TAT TCC AGG 6C6 CTG TGG GGG GTG 1584 
Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg Val 
515 520 525 

GCT GCT GAG GAG TAC GTG GAG GTT ACG CGG GTG GGG GAI TTC CAC TAG 1632 
Ala Ala Clu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His Tyr 
530 535 540 

GTG ACG AGG ATG ACC ACT GAC AAC GTA AAA TGC CCG TGC GAG GTT CCA 1680 
Val Thr Ser Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val Pro 
545 550 555 560 

GGC CCC GAA TTC TTC ACA GAA GTG GAT GGG GTG CGG CTG CAC AGG TAC 1728 
Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg Tyr 
565 570 575 

GCT CCG GCG TGC AAA CCT CTG CTA CGG GAG GAG GTC ACA TTC GAG GTC 1776 
Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Gin Val 
580 585 590 

GGG CTC AAC CAA TAC CTG GTT GGG TCG GAG GTC CCA TGC GAG CCC GAA 1824 
Gly Leu' Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro Cys Glu Pro Glu 
395 600 605 

CCG GAT GTA GCA GTG CTC ACT TCG ATG CTC ACC GAC CCC TCC CAC ATC 1872 
Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His He 
610 615 620 

ACA GCA GAG ACG GCT AAG CGC AGG CTG GCC AGG GGG TCT CCC CCC TCC 1920 
Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser 
625 630 635 640 
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TTG GCC AGC TOT TCA GOT A6C GAG TTG TCT GCG OCT TCC TOG AAG GCG 1968 
Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Ser Lys Ala 
645 650 655 

ACA TAG ATT ACC CAA AAT GAG TTC CCA GAG GOT GAG CTC ATG GAG GCC 2016 
Thr Tyt lie Thr Gin Asn Asp Phe Pro Asp Ala Asp Leu lie Glu Ala 
660 665 670 

AAG CTC CTG TGG CGG CAT GAG ATG GGC GGG GAG ATT ACC CGC GTG GAG 2064 
Asn Leu Leu Trp Arg His Glu Met Gly Gly Asp Ue Thr Arg Val Glu 
675 680 685 



TCA GAG AAG AAG GTA GTA ATG CTG GAC TCT TTC GAC CGG CTC CGA GCG 
Ser Glu Asn Lys Val Val lie Leu Asp Ser Phe Asp Pro Leu Arg Ala 
690 695 700 

GAG GAG GAT GAG CGG GAA GTG TCC GTC CCG GCG GAG ATG CTG GGG AAA 
Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu He Leu Arg Lys 
705 710 715 720 

TGG AAG AAA TTC CCA CCA GCG ATG GCC GCA TGG GCA CGC CCG GAT TAG 
Ser Lys Lys Phe Pro Pro Ala Met Pro Ala Trp Ala Arg Pro Asp Tyr 
725 730 735 

AAG CCT CCG CTG CTG GAG TCC TGG AAG GCC CCG GAC TAG GTC CCT CCA 
Asn Pro Pro Uu Leu Glu Ser Trp Lys Ala Pro Asp Tyr Val Pro Pro 
740 745 750 



2112 



2160 



2208 



2256 



GTG GTA CAT GGG TGG CCA CTG GCA CCT ACT AAG ACC CCT CCT ATA CCA 
Val Val His Gly Cys Pro Leu Pro Pro Thr Lys Thr Pro Pro He Pro 
755 760 765 

CCT CCA CGG AGG AAG AGG ACA GTT GTT CTG ACA GAA TCC ACC GTG TCT 
Pro Pro Arg Arg Lys Arg Thr Val Val Leu Thr Glu Ser Thr Val Ser 
770 775 780 



2304 



2352 
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TCT GCC CTG GCG GAG CTT GCC ACA AAG GCT TTC GGT AGO TCC GAA CC6 2400 
Ser Ala Leu Ala Glu Leu Ala Thr Lys Ala Fhe Gly Ser Ser Glu Pro 
785 790 795 800 

TOG GCC GTC GAC AGC GGC ACQ GCA ACC GCC OCT CCT GAG CAA CCC TCC 2448 
Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Pro Pro Asp Gin Pro Ser 
805 810 815 

GAC GAC GGC 6GA GCA GGA TCT GAC GTT GAG TGG TAT TCC TCC ATG CCC 2496 
Asp Asp Gly Gly Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Met Pro 
820 825 830 

CCC CTT GAG GGG GAG CCC GGG GAC CCC GAT CTC AGC GAC GGG TCT TGG 2544 
Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp 
835 840 845 

TCT ACC GTG AGT GAG GAG GCC GGT GAG GAC GTC GTC TGG TGC TCG ATG 2592 
Ser Thr Val Ser Glu Glu Ala Gly Glu Asp Val Val Cys Cys Ser Met 
850 855 860 

TCC TAC ACA TGG ACA GGC GCT CTG ATG ACG CCA TGC GCT GCG GAG GAA 2640 
Ser Tyr Thr Trp Thr Gly Ala Leu Xle Thz Pro Cys Ala Ala Glu Glu 
865 870 875 880 

AGC AAG CTG CCC ATG AAC GCG TTG AGC AAC TCT TTG CTG GGT CAC CAC 2688 
Ser Lys Leu Pro He Asn Ala Leu Ser Asn Ser Leu Leu Arg His His 
885 890 895 

AAC ATG GTC TAC GCT ACC ACA TCC CGC AGC GCA AGC CAG GGG CAG AAG 2736 
Asn Met Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser Gin Arg Gin Lys 
900 905 910 

AAG GTC ACC TTT GAC AGA CTG CAA ATC CTG GAC GAT CAC TAC CAG GAC 2784 
Lys Val Thr Phe Asp Arg Leu Gin He Leu Asp Asp His Tyr Gin Asp 
915 920 925 
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GTG CTC MG GAG ATG AAG GCG AAG GCG TCC AQA GTT AAG GOT AAG CTT 2832 
Val Leu Lys GXu Ket Lys Ala Lys Ala Ser Thr Val Lys Ala Lys Leu 
930 935 9*0 



CTA TCA GTA GAG GAA GCG TGC AAG GTG ACG CCC CCA CAT TCC GCC AAA 
Leu Set Val Glu GLu Ala Cys Lys Leu Thr Pro Pro His Ser Ala Lys 
945 950 935 960 

XCT AAA TTT GGC TAT GG6 GCA AAG GAG GTG CGG AAC CTA TCC AGC AAG 
Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn Leu Ser Ser Lys 
965 970 975 

GCC ATT AAC CAC ATC CGG TCC GTG TGG GAG GAC TTG TTG GAA GAC ACT 
Ala He Asn His He Arg Ser Val Trp Glu Asp Leu Leu Glu Asp Thr 
980 985 990 

GAA AGA CCA ATT GAC ACC ACC ATC ATG GCA AAA AAT GAG GTT TTC TGC 
GLu Thr Pro He Asp Thr Thr lie Met Ala Lys Asn Glu Val Phe Cys 
995 1000 1005 

GTC CAA CCA GAG AGA GGA GGC CGC AAG CCA GCT CGC CTT ATC GTG TTC 
Val Gin Pro Glu Arg Gly Gly Arg Lys Pro Ala Arg Leu He Val Phe 
1010 1015 1020 

CCA GAC- TTG 6GG GTC CGT GTG TGC GAG AAA ATG GCC CTC TAT GAC GTG 
Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val 
1025 1030 1035 1040 

GTC TCC ACC CTC GCT CAG GCT GTG ATG GGC TCC TCG TAG GGA TTC GAG 
Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser Ser Tyr Gly Phe Gin 
1045 1050 1055 

TAT TCT GCT GGA CAG CGG GTC GAG TTC CTG GTG AAC GCC TGG AAA TCA 
Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val Asn Ala Trp Lys Ser 
1060 1065 1070 



2880 



2928 



2976 



3024 



3072 



3120 



3168 



3216 
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AAG AAG ACC CCT ATG GGC TTT GCA TAT GAG AGO CGC TGI TTT GAC TCA 3264 
Lys Lys Thr Pro Met Gly Phe Ala Tyr Asp Thr Arg Cys Phe Asp Ser 
1075 1080 1085 

AGA GTC ACT GAG &AT GAC ATG CGT GTA GAG GAG TCA ATT TAT CAA TGT 3312 
Thr Val Ihr Glu Asn Asp He Arg Val Glu Glu Ser He Tyr Gin Cys 
1090 1095 1100 

TGT GAC TTG GCC CGC GAA GGC AGA CAG GGC ATA AGG TCG CTC ACA GAG 3360 
Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Arg Ser Leu Thr Glu 
1105 1110 1115 1120 

CGG CTT TAT ATC GGG GCT GCC CTG ACT AAT TCA AftA GGG CAG AAC TGC 3408 
Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser Lys Gly Gin Asn Cys 
1125 1130 1135 

GGC TAT CGC CGG TGC CGC GCG A6C GGC GTG CTG ACG ACT AGC TGC GGT 3456 
Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Tbr Thr Ser Cys Gly 
1140 1145 1150 

AAT ACC CTC ACA TGT TAC TTG MG GCC TCT GCA GCC TGT CGA GCT GCA 3504 
Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Cys Arg Ala Ala 
1155 1160 1165 

AAG CTC. CAG GAC TGC ACG ATG CTC GTG TGC GOA GAC GGC CTT GTC GTT 3552 
Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val Val 
1170 1175 1180 

ATC TGT GAG AGC GCG GGA ACC CAG GAG GAC GCG GCG AGC CTA CGA GTC 36O0 
He Cys Glu Ser Ala Gly Thr Gin Glu Asp Ala Ala Ser Leu Arg Val 
1185 1190 1195 1200 

TTC ACG GAG GCT ATG ACT AGG TAG TCT GCC GCC GCC GGG GAC CCG CCC 3648 
Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro 
1205 1210 1215 
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CAA CCA GAA TAG GAC CTG GAG TTG ATA ACA TCA TGC TCC TCC AAT GTG 3696 
Gin Pro Glu Tyr Asp Leu Glu Leu lie Thr Ser Cys Ser Sar Aan Val 
1220 1225 1230 



TCG GTC GOG CAC GAT GCA TCT GGC AAA AGG GTA TAG TAG CTC ACC CGT 
Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr Tyr Leu Thr Arg 
1235 1240 1245 



3744 



GAC CC6 
Asp Pro 
1250 



3750 



MJS/AC/12th December 1990 



- 98 - 



PA1121 



SEQ ID NO: 23 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 23 BASES 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DKA 

ORIGINAL SOURCE ORGANISM:baculo virus Autographa califomica Nuclear 
Polyhedrosis virus (AcNPV) 

XMKEDIATE EXPERIMENTAL SOURCE: Oligonucleotide synthesiser; oligo d2U 
FEATURES : 

from 1 to 23 bases homologous to portion of AcNPV polyhedrin gene 
downstream of the BaniHl cloning site in pAc360 and similar vectors 

PROPERTIES: primes DNA synthesis from baculovixus transfer vector 
seqixences which flank DNA inserted at the BamHl site. 



CGGGTTTAAC ATTACGGATT TCC 23 
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SEQ ID NO: 24 

SEQUENCE TYPE: Nucleotide 
SKJUENCE LENGTH: 31 BASES 

STRANDEDNESS : single 

XOPOLOCnf: linear 

HOLECDLE TYPE : synthetic DNA 

ORIGINAL SOURCE ORGANISM: baculovirus Autographa californica Nuclear 
Polyhedrosis virus (AcOTV) 

IMMEDIATE EXPERIMENTAL SOURCE: Oligonucleotide synthesiser; oligo dl26 
FEATURES: 

from 1 to 31 bases homologous to the upstream junction sequences 
produced vhen cDNA amplified by d75 (SEQ ID 5) is cloned into the 
BamHl cloning site in pAc360 and similar vectors; mismatches at bases 
13 and 14 introduce a Pstl site 

from 1 to 10 bases homologous to region of BamHl site in pAc360 and 

similar vectors 

from 4 to 9 bases BamHl site 

from 12 to 17 bases Pstl site 

PROPERTIES :primes DNA synthesis at the junction of baculovirus 
transfer vector sequences and sequences previously amplified by oli^o 
d75; introduces a Pstl recognition site for subsequent cloning vork 

TAAGGATCCC OCT CCA GTA TCG GCG GAA TTC 
Ser Ala Val Ser Ala Glu Phe 
5 
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SEQ ID N0:25 

SEQUEHGE TYP£:Sucleot:ide 
SEQUENCE LEN0TH:45 BASES 

SXRANDEDNESS : single 

TOPOLOCT: linear 

MOLECULE TYPE :syntihe tic DNA 

ORIGINAL SOURCE ORGANISK:K/A 

IMMEDIATE EXPERIMENTAL SOURCE : Oligonucleotide synthesiser; oligo dl32 
FEATURES: 

from 5 to 10 bases Pstl recognition site 

from 13 to 27 bases linker coding for five Lys residues 

from 28 to 45 bases homologous to bases 4 to 21 of BRll (SEQ ID 7) 

PROPERTIES rprimes DNA synthesis at the 5' end of BRll and introduces a 
synthetic sequence vhich codes for five lysines as well as a Pstl 
recognition site for sxabsequent cloning work 

CTGCCTGCA GTA AAG AAG AAG AAG AAG AAA ACC AAA CGT AAC ACC A 45 
Val Lys Lys Lys Lys Lys Lys Thr Lys Arg Asn Leu 
5 10 
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Claims : 



3. 



4. 



APT-NANBH viral polypeptide comprising an antigen having an 
amino acid sequence that is at least 90% homologous with the 
amino acid sequence set forth in SEQ ID-KO: 3.4.5.18.19.20.21 or 
22, or an antigenic fragment thereof. 

A PT-NAKBH viral polypeptide according to claim 1. in which the 
anlino acid sequence is at least 90% homologous with the amino 
acid sequence set forth in SEQ ID NO: 3. 4, or 5. or is an 
antigenic fragment thereof. 

A rr-NANBH viral polypeptide according to claim 2. in which the 
amino acid sequence is at least 90% homologous with the ammo 
acid sequence set forth in SEQ ID NO: 3 or 4. or is an antigenic 

fragment thereof, 

A PT-NANBH viral polypeptide according to claim 2, in which the 
amino acid sequence is at least 90% homologous with the amino 
acid sequence set forth in SEQ ID NO: 5. or is an antigenic 

fragment thereof, 

A PT-NAKBH viral polypeptide according to any one of the 
preceding claims, in which the amino acid sequence is at least 
95% homologous with the amino acid sequence set forth in the SEQ 
ID NO., or is an antigenic fragment thereof. 

A PT-NAHBH viral polypeptide according to claim 5, in which the 
amino acid sequence is at least 98% homologous with the amino 
acid sequence set forth in the SEQ ID NO. . or is an antigenic 
fragment thereof- 

APT-NANBH viral polypeptide comprising an antigen from the 
structural coding region of the viral genome and an antigen fra« 
the non- structural coding region of the viral genome. 
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8, A PT-SANBH viral polypeptide according to claim 7, in which the 
antigen from the structural coding region has an amino acid 
sequence that is at least 90% homologous with the amino acid 
sequence set forth in SEQ ID NO : 5. or an antigenic fragment 
thereof, and the antigen from the nonr structural coding region 
has an amino acid sequence that is at least 90% homologous with 
the amino acid sequence set forth in SEQ ID NO : 3 or 4, or an 
antigenic fragment thereof. 

9, A DNA sequence encoding a PT-NANBH viral polypeptide according to 
any one of claims 1 to 8, 

10, A DNA sequence according to claim 9 as set forth in SEQ ID KO: 
3. 4, 5, 18, 19. 20. 21 or 22. 

11, An expression vector containing a DNA sequence, according to 
either of claims 9 and 10 , and being capable in an • appropriate 
host of expressing the DNA sequence to produce a PT-NANBH viral 
polypeptide . 

12, An host cell transformed with an expression vector according to 
claim 11* 

13, A process for preparing PT-NANBH viral polypeptide which 
comprises cloning, or synthesising a DNA sequence encoding 
PT-NANBH viral polypeptide according to any one of claims 1 to 8, 
inserting t±e DNA sequence into an expression vector such that it 
is capable in an appropriate host of being expressed, 
transforming an host cell with the expression vector, culturing 
the transformed host cell, and isolating the viral polypeptide. 

14, A polyclonal or monoclonal antibody against a PT-NANBH viral 
polypeptide, according to any one of claims 1 to 6, 
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15. A method for the detection of PT-NANBH viral mcleic acid, which 
comprises : 

i) hybridising viral 8M present in a test sample, or cDNA 
synthesised from such SKA, with a DNA sequence corresponding 
to SEQ ID NO: 3. 4. 5, 18. 19. 20. 21 or 22. and screening 
the resulting nucleic acid hybrids to identify any PT-HASBH 
viral nucleic acid; or 

ii) synthesising cDNA from viral RNA present in a test sample, 
amplifying a preselected DNA sequence corresponding to a 
subsequence of the SEQ ID NO: 3. 4. 5. 18. 19. 20. 21 or 22, 
and Identifying the preselected DNA sequence. 

16. A test: kit for the detection of PT-NANBH viral nucleic acid, 
which comprises: 

i) a pair of oligonucleotide primers one of which corresponds 
to a portion of the nucleotide sequence of SEQ ID NO: 
3.4,5,18.19.20,21 or 22 and the other of which is located to 
the' 3' side of the first and corresponds to a portion of the 
complementary sequence, the pair defining between them a 
preselected DNA sequence; 

ii) a reverse transcriptase enzyme for the synthesis of cDNA 
from test sample RNA upstream of the primer corresponding to 
the complementary nucleotide sequence of SEQ ID NO: 
3,4,5.18.19,20.21 or 22; 

iii) an enzyme capable of amplifying the preselected DNA 

sequence; and optionally 

iv) washing solutions and reaction buffers- 
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17. A method for tha detection of PT-NANBH viral antigen or viral 
antibody, which comprises contacting a test sample with a 
FI-NANBH viral polypeptide according to any of claims 1 to 8, or 
a polyclonal or monoclonal antibody according to claim 14, and 
determining whether tiiere is ai^ antigen- antibody binding 
contained within the test sample. 

18. A test kit for the detection of FE-NANBH viral antigen or viral 
antlbo<fy, which comprises a PT-KASBH viral polypeptide according 
to any of claims 1 to 8, or a polyclonal or monoclonal antibody 
according to claim 14, and means for determining whether there is 
any antigen- antibody binding contained within the test sample. 

19. A vaccine formulation which comprises a PT-NANBH viral 
polypeptide according to any of claims 1 to 8, in association 
vith a pharmaceutlcally acceptable carrier, 

20. A method for Inducing immunity in man to PT-NAHBH. which 
comprises the administration of an effective amount of a vaccine 
formulation according to claim 19. 
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